EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    Hotel WebCrawler 多线程 算法题 Math 净利润 uWSGI NameSilo RGB Firewall SVR Jupyter FlashAttention BTC printf DeepStream Michelin Bert Baidu GPTQ Tiktoken 公式 图形思考法 Conda 搞笑 CSV Disk Dataset Pandas Distillation transformers FastAPI Quantize Datetime VSCode BF16 git-lfs HaggingFace Mixtral BeautifulSoup WAN Ptyhon Domain OCR CTC Claude tqdm Land FP16 Pillow GGML Plate SPIE TTS IndexTTS2 Gemma Tracking SQLite FP64 VGG-16 Safetensors PyTorch Llama 云服务器 PyCharm VPN FP32 Video Nginx Qwen Password Input Tensor 报税 git uwsgi CC 递归学习法 Attention Vim Translation Numpy Pytorch PDF Proxy LoRA Data Statistics Breakpoint Search Django scipy Food 顶会 Animate hf LLM tar 阿里云 JSON Excel 继承 Jetson Card Streamlit 签证 v2ray Quantization COCO PIP mmap PDB Algorithm 证件照 Miniforge Diagram XGBoost Knowledge Paper UNIX 飞书 Hilton Permission QWEN Zip Crawler Qwen2 音频 Color 腾讯云 TensorRT Google Bipartite AI Windows Review Web DeepSeek Hungarian Freesound MD5 LLAMA ModelScope UI llama.cpp Python 强化学习 OpenAI ChatGPT NLP v0.dev Anaconda RAR CLAP Image2Text Logo API OpenCV LaTeX Pickle Magnet Git diffusers Qwen2.5 HuggingFace YOLO Github Base64 Clash CAM Markdown ResNet-50 torchinfo 财报 版权 CEIR LeetCode Bin Paddle Ubuntu FP8 Linux Bitcoin Interview SAM Agent CUDA InvalidArgumentError GoogLeNet Template Transformers Cloudreve GIT Sklearn Website Shortcut C++ XML 多进程 Plotly TSV NLTK GPT4 News TensorFlow Heatmap CV Use Random 第一性原理 Vmess ONNX Augmentation 域名 SQL EXCEL logger 关于博主 Docker
    站点统计

    本站现有博文321篇,共被浏览772964

    本站已经建立2461天!

    热门文章
    文章归档
    回到顶部