EADST

Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Highlight:

  • Optimal Alpha Scaling: Focuses on determining the optimal alpha value for scaling weights prior to quantization.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
icon GIT Diagram tqdm Logo Paddle 算法题 Vim Algorithm Pillow MD5 Input Card CEIR COCO Bitcoin Statistics TTS Docker Disk Plate Claude 签证 SVR Firewall ONNX Jupyter Translation SAM Transformers Quantize Tensor Web Animate Interview Tracking Augmentation Git Quantization LoRA Hilton Michelin Pytorch Hungarian TSV Bert Jetson IndexTTS2 Freesound Dataset FP32 强化学习 Proxy ResNet-50 Qwen2 git-lfs 继承 净利润 Website v0.dev Gemma FP8 TensorFlow 财报 PyCharm Agent API Miniforge VPN CLAP FP64 git VGG-16 CC Attention Review FP16 llama.cpp Data DeepStream Github C++ Search QWEN 域名 uwsgi Nginx ModelScope uWSGI SPIE Ubuntu Cloudreve RGB logger ChatGPT Magnet Crawler tar LLM EXCEL UNIX HuggingFace CSV VSCode mmap Rebuttal Clash 搞笑 Heatmap UI NameSilo Streamlit Zip printf torchinfo 音频 Qwen FlashAttention BeautifulSoup 图形思考法 顶会 BTC OpenCV Python InvalidArgumentError FastAPI Markdown v2ray PyTorch 证件照 Bin HaggingFace AI 关于博主 NLP scipy Bipartite Vmess News Base64 SQL CAM Excel 公式 Llama Use hf Plotly 图标 第一性原理 OpenAI GoogLeNet OCR Video JSON Knowledge 递归学习法 Distillation Land Qwen2.5 RAR Sklearn Template TensorRT LaTeX Google Paper CUDA Color LeetCode PIP DeepSeek 阿里云 Math 腾讯云 YOLO Baidu 云服务器 NLTK Breakpoint Django Safetensors WAN BF16 CTC 报税 GPTQ 飞书 CV Domain Windows XML Pandas 多进程 Shortcut GPT4 Password GGML Ptyhon SQLite diffusers 版权 Hotel Datetime Mixtral XGBoost WebCrawler 多线程 Permission Pickle PDF PDB Food transformers Anaconda Random Linux LLAMA Conda Tiktoken Numpy Image2Text
站点统计

本站现有博文324篇,共被浏览812395

本站已经建立2516天!

热门文章
文章归档
回到顶部