EADST

Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Highlight:

  • Hyper-parameter for Outliers: Implements a novel approach using a specific hyper-parameter to manage outliers effectively during the quantization process.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
mmap Transformers FP64 多进程 音频 ChatGPT Safetensors WAN git-lfs Shortcut Datetime Website UI 强化学习 FlashAttention DeepSeek COCO 顶会 SQLite Claude BF16 Jupyter Excel Disk git tqdm Base64 Data Domain 递归学习法 Markdown Hilton 域名 LeetCode torchinfo printf Pandas 搞笑 Git News diffusers Pytorch Permission Paddle PIP Quantization 净利润 Paper Translation ONNX hf Miniforge Pillow Gemma 多线程 VGG-16 VSCode Vim Zip Breakpoint CEIR Windows Firewall GPTQ Tensor FP32 Qwen2.5 Qwen 算法题 版权 LLM Jetson logger transformers FP8 Bert TensorRT Streamlit Algorithm Nginx Cloudreve Conda Docker Hotel Proxy 报税 OpenCV Ptyhon Magnet FastAPI v2ray tar Random CLAP LoRA CAM Tiktoken CUDA InvalidArgumentError Food Augmentation SAM 证件照 阿里云 JSON Github Llama GoogLeNet Ubuntu XML TSV Django Quantize Math HuggingFace 关于博主 Linux NLP Color 第一性原理 OpenAI uwsgi PyTorch scipy Crawler BTC Review QWEN Card Google Python Sklearn SQL Clash Land PyCharm DeepStream NLTK Anaconda 继承 Qwen2 公式 Baidu Freesound Use AI Statistics Mixtral Heatmap Agent Video SPIE LaTeX MD5 PDF TensorFlow Template CC Knowledge HaggingFace Tracking Search TTS GIT XGBoost RAR VPN Input 飞书 财报 Web UNIX 签证 Bipartite uWSGI Plotly Diagram v0.dev PDB Plate CSV WebCrawler 图形思考法 FP16 YOLO Michelin Bitcoin C++ Distillation Interview llama.cpp Hungarian API Animate Bin Numpy Vmess BeautifulSoup Logo NameSilo Image2Text 腾讯云 CTC Pickle CV ModelScope ResNet-50 EXCEL IndexTTS2 GGML Password GPT4 OCR SVR Attention RGB Dataset LLAMA
站点统计

本站现有博文320篇,共被浏览759200

本站已经建立2427天!

热门文章
文章归档
回到顶部