EADST

Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Highlight:

  • Optimal Alpha Scaling: Focuses on determining the optimal alpha value for scaling weights prior to quantization.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
tar ModelScope GPTQ TSV DeepSeek Docker Image2Text Quantize Quantization Sklearn TensorRT Firewall 继承 Datetime WebCrawler Web OpenCV v2ray CV Clash DeepStream Numpy BF16 BTC Jetson ResNet-50 Search Tracking Dataset Math Random Input Python git-lfs 公式 搞笑 YOLO uWSGI PDF 递归学习法 API Color Cloudreve QWEN WAN Pandas PyCharm NLP Hungarian Bipartite TTS GIT Magnet 版权 Github 净利润 Crawler Nginx BeautifulSoup Shortcut NameSilo GPT4 顶会 多进程 Bin 域名 HaggingFace Baidu Qwen2.5 Base64 Card FP32 FP8 CC LaTeX Bert XML scipy CTC Paddle 证件照 FlashAttention Statistics Transformers Algorithm Template 关于博主 财报 Hotel SPIE 报税 logger Paper CLAP 强化学习 Tensor InvalidArgumentError XGBoost 图形思考法 MD5 Claude Ubuntu Domain FP64 Excel Review 签证 JSON Augmentation Translation Michelin Knowledge Django Miniforge Bitcoin Ptyhon Tiktoken Qwen News UI uwsgi PIP Freesound RGB Linux Plate LLAMA Diagram Pickle Website printf git PyTorch 音频 Markdown transformers Breakpoint FastAPI mmap CAM C++ OCR Hilton ChatGPT 算法题 Mixtral Disk llama.cpp NLTK Qwen2 FP16 CEIR 云服务器 ONNX 阿里云 VPN Conda v0.dev hf LoRA Llama Pillow Permission RAR Pytorch Animate tqdm Logo SQLite Vim Heatmap CUDA Agent HuggingFace Google GGML Land Plotly Safetensors Zip 多线程 LLM Vmess 第一性原理 Video EXCEL Distillation 腾讯云 Git Data IndexTTS2 torchinfo Jupyter PDB Anaconda Interview Password SAM Food COCO Windows OpenAI SQL Proxy GoogLeNet LeetCode AI Attention Gemma TensorFlow SVR VSCode 飞书 diffusers UNIX Use Streamlit VGG-16 CSV
站点统计

本站现有博文321篇,共被浏览767788

本站已经建立2451天!

热门文章
文章归档
回到顶部