EADST

Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Highlight:

  • Hyper-parameter for Outliers: Implements a novel approach using a specific hyper-parameter to manage outliers effectively during the quantization process.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
v0.dev News BeautifulSoup Interview FP16 域名 VGG-16 净利润 Use TensorRT 搞笑 uwsgi 顶会 Tensor OpenCV Tiktoken FP32 PyTorch Base64 飞书 Markdown SQL Disk Logo FP8 CV Website Qwen 多进程 Datetime Statistics 腾讯云 CLAP Animate 图标 DeepSeek Linux v2ray SVR UNIX OpenAI Pandas Food WebCrawler WAN Magnet 云服务器 Vim MD5 Clash Streamlit Data Template TSV Bert Quantize Dataset Distillation Python Hotel CC EXCEL 财报 Git XML Nginx Heatmap RAR GIT git transformers Firewall Shortcut Hungarian Jetson scipy Algorithm 第一性原理 Safetensors DeepStream QWEN 关于博主 Random Bipartite Domain YOLO 版权 BTC PyCharm Michelin Translation Diagram PIP GGML tqdm Ubuntu Crawler Jupyter 签证 VSCode UI Card 公式 LoRA Conda Cloudreve Agent Paddle ResNet-50 Pickle FlashAttention Baidu Excel ModelScope Qwen2.5 LLAMA 证件照 AI 图形思考法 Sklearn Anaconda Gemma FP64 Image2Text Color Transformers Plotly COCO llama.cpp VPN Search Plate Bitcoin GoogLeNet 阿里云 Freesound PDB Proxy CTC Rebuttal 递归学习法 Knowledge Input CUDA git-lfs 算法题 Pillow ChatGPT Numpy Augmentation Web CAM printf logger Google Github LaTeX OCR 强化学习 RGB Claude torchinfo Quantization TensorFlow Docker diffusers C++ Vmess ONNX 音频 Windows Llama BF16 PDF GPTQ uWSGI IndexTTS2 LeetCode Video Permission CSV 继承 hf API TTS Password icon CEIR Django Zip Paper Land Ptyhon SQLite Breakpoint JSON LLM tar Miniforge Bin FastAPI GPT4 XGBoost InvalidArgumentError 报税 Mixtral mmap Hilton Review HaggingFace SAM Pytorch NameSilo Qwen2 多线程 Tracking SPIE NLP NLTK Attention Math HuggingFace
站点统计

本站现有博文324篇,共被浏览812396

本站已经建立2516天!

热门文章
文章归档
回到顶部