EADST

Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Highlight:

  • Optimal Alpha Scaling: Focuses on determining the optimal alpha value for scaling weights prior to quantization.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
OpenCV Agent NameSilo EXCEL Domain Anaconda Dataset git 搞笑 Bitcoin Miniforge PDF LoRA 音频 多线程 域名 Hungarian 腾讯云 Password 报税 Vim Pillow 强化学习 uwsgi Review Excel 版权 Color LLAMA Website GPT4 News Clash 财报 证件照 阿里云 NLTK BeautifulSoup LeetCode 签证 Video mmap SVR Heatmap tar Llama Mixtral Git Search SPIE 算法题 BF16 API llama.cpp Augmentation Logo 论文 Knowledge Zip UNIX Paper 公式 Datetime ModelScope Transformers Firewall FastAPI MD5 Markdown CV 净利润 Magnet Cloudreve Template Paddle FP32 HuggingFace GoogLeNet 多进程 transformers Github PyTorch printf PIP YOLO Plate FP8 TensorRT diffusers Hotel CTC C++ Disk 云服务器 v0.dev FlashAttention Ubuntu ResNet-50 Statistics Ptyhon Hilton Vmess VSCode PDB CC FP64 Attention Plotly SQLite IndexTTS2 ChatGPT Quantization BTC CLAP git-lfs Google Random Bin RAR Conda Rebuttal 图标 CUDA Distillation Nginx 递归学习法 AI Qwen2 TTS Tensor CSV Windows Pytorch Claude QWEN Data OpenAI HaggingFace 顶会 VGG-16 LaTeX 论文速读 GGML Sklearn Permission Image2Text Gemma Use Proxy Pandas Jupyter Streamlit PyCharm Interview SQL Tiktoken CAM Algorithm Qwen2.5 Quantize FP16 GIT icon Base64 Pickle Card InvalidArgumentError Crawler Breakpoint LLM OCR scipy Django Diagram Numpy Translation XGBoost VPN uWSGI Safetensors 关于博主 hf Math Food TensorFlow UI SAM v2ray NLP Qwen tqdm 第一性原理 Baidu RGB 飞书 Freesound ONNX Bert XML COCO DeepStream Web Michelin CEIR Input Shortcut TSV Jetson DeepSeek WebCrawler Bipartite 继承 Tracking Python Animate Linux Land GPTQ torchinfo 图形思考法 Docker logger WAN JSON
站点统计

本站现有博文327篇,共被浏览835447

本站已经建立2540天!

热门文章
文章归档
回到顶部