EADST

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Highlights:

  • FP4 Weight Quantization: Implements 4-bit floating-point (FP4) quantization for model weights.
  • FP8 Activation Quantization: Utilizes 8-bit floating-point (FP8) quantization for activations, optimizing the balance between performance and precision.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
icon logger FP8 Cloudreve News Qwen2.5 Agent 版权 Rebuttal Vim CV Windows Google Vmess uwsgi LeetCode Pytorch UI Shortcut GPT4 ResNet-50 CC Quantization SAM 论文 Tensor Hotel Safetensors Crawler ChatGPT WebCrawler Jetson Michelin CSV Claude Anaconda ModelScope Use Animate HuggingFace GPTQ Proxy Tracking Markdown Gemma Mixtral 算法题 Search Permission NameSilo Statistics SQL Math Numpy Streamlit 顶会 GIT 净利润 BeautifulSoup Augmentation JSON Excel 域名 RGB tar BF16 TensorFlow Magnet 腾讯云 Domain 搞笑 Website llama.cpp OpenAI SPIE XGBoost 图标 Miniforge VGG-16 Bert Sklearn Breakpoint 图形思考法 Tiktoken Translation OCR PIP Ubuntu LLAMA CLAP Hilton 云服务器 Ptyhon Firewall Paper Clash Transformers WAN Pillow InvalidArgumentError DeepStream VPN Conda AI CUDA Random Baidu 财报 Land 公式 BTC Card Knowledge v0.dev UNIX hf Bitcoin LoRA MD5 torchinfo 论文速读 Image2Text 签证 ONNX 多线程 Dataset Docker FP32 多进程 OpenCV Plotly Input Heatmap FP16 Zip transformers IndexTTS2 Base64 RAR EXCEL 递归学习法 Pandas 第一性原理 Django 关于博主 uWSGI Qwen YOLO Logo CAM SQLite Diagram Bipartite Llama TTS Bin 阿里云 PyCharm SVR Video LaTeX Nginx Jupyter NLTK 报税 FastAPI COCO Python Password CTC HaggingFace FP64 Algorithm Paddle mmap Data Qwen2 Interview LLM DeepSeek Plate API VSCode 强化学习 TensorRT GGML PDB Food Attention NLP Disk Git 音频 Template PyTorch Quantize C++ Datetime Color PDF Freesound FlashAttention scipy v2ray Github 继承 CEIR tqdm Review printf Linux git-lfs TSV Hungarian Web QWEN Pickle GoogLeNet diffusers Distillation 飞书 证件照 git XML
站点统计

本站现有博文328篇,共被浏览858390

本站已经建立2566天!

热门文章
文章归档
回到顶部