Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs| 东毅居士

Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs

作者：XD / 发表： 2023年12月7日 00:45 / 更新： 2023年12月7日 00:57 / 科研学习 / 阅读量：1707

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Paper: SmoothQuant on arXiv
Code: SmoothQuant on GitHub
Organization: MIT

Highlight:

Hyper-parameter for Outliers: Implements a novel approach using a specific hyper-parameter to manage outliers effectively during the quantization process.

本文作者：XD 转载请标明出处：http://www.eadst.com/blog/229

本站采用知识共享署名-非商业性使用-相同方式共享 4.0 国际许可协议进行许可。

上一篇
Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

下一篇
Setting Up v2rayNG with Tencent Cloud Silicon Valley Lighthouse

Category

标签云

InvalidArgumentError printf Bin Food Michelin 版权 uwsgi EXCEL WebCrawler Streamlit VSCode BTC scipy Nginx Safetensors Plate TensorRT PIP 音频 Markdown DeepSeek Git Cloudreve Jupyter VPN Quantization GIT Distillation Dataset GoogLeNet Claude RGB HaggingFace git PDB tqdm Shortcut CSV Datetime Augmentation 阿里云 LLM Quantize Excel Qwen Jetson Knowledge Magnet HuggingFace Diagram API GPT4 Tracking logger Color Google Ubuntu Input XGBoost ChatGPT SPIE Hungarian CEIR Translation PyTorch Zip Baidu torchinfo Hotel 飞书 CTC FastAPI Pandas NLP FP32 财报 NameSilo LoRA Github mmap Permission Interview Domain Pytorch Statistics Password VGG-16 PyCharm 净利润 v2ray git-lfs Card Sklearn diffusers Logo 多线程 COCO Attention RAR JSON GPTQ Llama Algorithm UNIX Bert 继承 OpenAI Use Paddle QWEN SQL hf Base64 v0.dev Clash FP16 Review ModelScope ResNet-50 LeetCode llama.cpp Vmess Bipartite CV Python transformers Docker ONNX Pillow Hilton Vim SQLite tar BeautifulSoup LLAMA Image2Text TTS Heatmap MD5 Freesound OCR Transformers 签证公式 Website DeepStream Ptyhon UI BF16 Tiktoken Firewall LaTeX FlashAttention CAM 关于博主 Math OpenCV Gemma Bitcoin Conda Linux 多进程 Data Crawler Breakpoint FP64 FP8 证件照 CC Qwen2.5 Qwen2 搞笑 uWSGI Plotly 算法题 XML Tensor AI C++ 报税 Numpy Pickle Windows Mixtral CUDA CLAP Disk PDF TensorFlow YOLO Random 域名 Template Web Django SVR TSV Proxy GGML NLTK Video Paper Anaconda Land 腾讯云

站点统计

本站现有博文304篇,共被浏览707335次

本站已经建立2327天!

原 Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs

作者：XD / 发表： 2023年12月7日 00:45 / 更新： 2023年12月7日 00:57 / 科研学习 / 阅读量：1707

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Highlight:

Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs