EADST

Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Highlight:

  • Hyper-parameter for Outliers: Implements a novel approach using a specific hyper-parameter to manage outliers effectively during the quantization process.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Conda Agent Vmess Tracking COCO 递归学习法 FP16 ResNet-50 Paddle Pytorch QWEN GGML Crawler Markdown Vim 腾讯云 WAN GIT Baidu Miniforge Domain torchinfo Qwen Qwen2.5 SVR Datetime Video Python 继承 Gemma Interview 第一性原理 多进程 LLM GoogLeNet BF16 Michelin PyTorch GPTQ FlashAttention Food Augmentation BeautifulSoup PDF transformers TensorFlow Transformers FP8 Streamlit 净利润 Template 阿里云 Cloudreve CEIR Numpy 证件照 tqdm Password LoRA 财报 OpenCV Animate SQLite MD5 v0.dev Claude Disk uWSGI 音频 Django CC Tiktoken TensorRT Math Pickle Knowledge Linux Jupyter LLAMA FP32 ModelScope Google Translation DeepStream Quantization VGG-16 CLAP 算法题 YOLO UI 关于博主 NLP llama.cpp XML SPIE tar Statistics hf Algorithm Ubuntu SQL Attention Excel Clash IndexTTS2 Bert git OCR Base64 Sklearn 签证 Random Pandas Pillow Hungarian EXCEL AI 图形思考法 Nginx HuggingFace mmap PyCharm API SAM Diagram TSV GPT4 LaTeX Quantize Magnet Github JSON Safetensors TTS Plate Card BTC Data Hilton Logo Zip CSV Freesound PIP VPN 多线程 飞书 OpenAI Firewall NLTK Review Windows 搞笑 Image2Text Git CUDA printf Shortcut 域名 VSCode Bin Dataset git-lfs DeepSeek Anaconda Mixtral PDB Ptyhon CTC UNIX 报税 Website scipy Breakpoint RAR diffusers Plotly Input Color Heatmap Llama Docker CAM XGBoost ONNX 公式 Bitcoin Jetson CV uwsgi Qwen2 RGB Distillation Paper logger WebCrawler Use Tensor FP64 InvalidArgumentError C++ 版权 Permission Proxy LeetCode HaggingFace Web Land FastAPI v2ray ChatGPT Bipartite NameSilo Hotel
站点统计

本站现有博文316篇,共被浏览748355

本站已经建立2398天!

热门文章
文章归档
回到顶部