Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
作者:XD / 发表: 2023年12月7日 00:38 / 科研学习/ 阅读量:2411
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Paper: https://arxiv.org/abs/2306.00978
Code: https://github.com/mit-han-lab/llm-awq/
Organization: MIT