EADST

Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Highlight:

  • Hyper-parameter for Outliers: Implements a novel approach using a specific hyper-parameter to manage outliers effectively during the quantization process.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
git-lfs tar Transformers PDF Jupyter 签证 Agent Clash Color 论文速读 Bin PIP Logo Tensor Paper diffusers Translation Card Augmentation Plate Google Pandas Pillow Nginx GPT4 LoRA Quantize Plotly 算法题 Gemma scipy 图标 API Qwen2 PDB Algorithm EXCEL icon Hilton LLM Vim OCR News Quantization 论文 ChatGPT RAR CSV Linux 公式 第一性原理 C++ git Ubuntu Search 云服务器 Web Permission 继承 printf Anaconda Excel PyCharm ModelScope CC FP16 PyTorch BeautifulSoup Video FastAPI Attention UNIX Docker Data Bitcoin Michelin Tracking ResNet-50 Dataset llama.cpp WebCrawler Base64 Breakpoint Qwen NLP GoogLeNet NameSilo BF16 Qwen2.5 Zip Markdown 音频 TensorRT Magnet CAM UI HuggingFace Hungarian Datetime SAM Django 版权 DeepStream FlashAttention Math 搞笑 Template 净利润 torchinfo SQLite Crawler FP64 IndexTTS2 Random Claude GGML 阿里云 Pickle Food Rebuttal TSV ONNX 证件照 Git Distillation Use Baidu Statistics Conda transformers TTS Paddle Animate Ptyhon Hotel MD5 XGBoost Numpy JSON OpenAI HaggingFace COCO 强化学习 FP32 Proxy Disk 飞书 TensorFlow WAN CV Password Land OpenCV tqdm CTC Interview logger Firewall Vmess InvalidArgumentError 顶会 Windows Website Tiktoken VSCode 报税 Bipartite CUDA GPTQ VPN SVR uwsgi Heatmap hf 多进程 CLAP YOLO 财报 关于博主 DeepSeek 域名 Sklearn BTC VGG-16 Domain Pytorch AI SPIE Github SQL 腾讯云 Knowledge Python Image2Text GIT LeetCode CEIR uWSGI Streamlit XML Input Shortcut QWEN 递归学习法 Safetensors mmap Jetson Miniforge Review v2ray Cloudreve LaTeX Mixtral LLAMA 图形思考法 Bert Llama 多线程 Diagram FP8 v0.dev RGB Freesound NLTK
站点统计

本站现有博文328篇,共被浏览858328

本站已经建立2566天!

热门文章
文章归档
回到顶部