EADST

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Highlights:

  • FP4 Weight Quantization: Implements 4-bit floating-point (FP4) quantization for model weights.
  • FP8 Activation Quantization: Utilizes 8-bit floating-point (FP8) quantization for activations, optimizing the balance between performance and precision.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Attention 腾讯云 logger UNIX git ONNX IndexTTS2 关于博主 CTC Baidu News Google 证件照 第一性原理 Michelin Pickle Knowledge Agent Plotly Paddle icon Website 音频 CSV Markdown SQL Django Vim RGB OpenCV Pillow Data uWSGI Firewall Hungarian Pandas DeepStream 净利润 transformers Tensor PyTorch Sklearn GIT git-lfs FP16 Python Tracking CV FP32 Miniforge TTS hf v0.dev Pytorch Windows FlashAttention printf Image2Text TensorFlow XGBoost NameSilo Algorithm NLP Datetime 继承 Random ModelScope uwsgi Disk Paper Plate Streamlit HuggingFace Rebuttal Use PIP Quantize Bert GoogLeNet 签证 SVR Docker Qwen2.5 SQLite TSV Jupyter Math Bipartite Web Qwen2 Transformers Quantization Heatmap Qwen UI Llama HaggingFace PyCharm FP64 财报 Permission GPT4 强化学习 SPIE C++ Diagram mmap Shortcut Hilton OpenAI Animate FP8 LeetCode VGG-16 Distillation 域名 Excel Ptyhon Freesound AI Clash 报税 阿里云 Domain 多线程 SAM JSON Nginx Logo Claude Land Proxy Password 公式 BF16 Gemma Bin ChatGPT Template 图标 scipy diffusers LLAMA Bitcoin Dataset CAM TensorRT 算法题 FastAPI WebCrawler Breakpoint Statistics LaTeX v2ray CLAP CC 飞书 Mixtral Review Ubuntu VPN OCR DeepSeek 递归学习法 CUDA 图形思考法 tqdm 多进程 YOLO Augmentation Safetensors Jetson Zip COCO Card ResNet-50 Crawler GGML API 搞笑 QWEN Tiktoken Input InvalidArgumentError EXCEL Vmess Git PDF MD5 tar PDB 云服务器 Magnet llama.cpp Food Base64 BTC XML torchinfo Color Numpy VSCode LoRA BeautifulSoup GPTQ Anaconda RAR Video Translation 顶会 Conda 版权 Hotel WAN Search NLTK Github Linux Interview Cloudreve LLM CEIR
站点统计

本站现有博文323篇,共被浏览796714

本站已经建立2494天!

热门文章
文章归档
回到顶部