Quick Review: Norm Tweaking: High-performance Low-bit Quantization of Large Language Models| 东毅居士

Quick Review: Norm Tweaking: High-performance Low-bit Quantization of Large Language Models

作者：XD / 发表： 2023年12月6日 23:44 / 更新： 2023年12月6日 23:52 / 科研学习 / 阅读量：1433

Norm Tweaking: High-performance Low-bit Quantization of Large Language Models

Paper: Norm Tweaking on arXiv
Code: None available
Organization: Meituan

Steps for Implementation:

Generate Data: Prepare and preprocess the dataset suitable for training the model.
GPTQ: Apply GPTQ method for optimizing the quantization precision of model parameters.
Train LayerNorm Only: Focus on training the Layer Normalization component of the model for fine-tuning and optimization.

本文作者：XD 转载请标明出处：http://www.eadst.com/blog/223

本站采用知识共享署名-非商业性使用-相同方式共享 4.0 国际许可协议进行许可。

上一篇
Review: H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

下一篇
Quick Review: Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs

相关标签

LLM Quantization

About Me

XD

Goals determine what you are going to be.

Category

标签云

Base64 Distillation LaTeX DeepStream Diagram git Windows FP16 Domain Crawler 证件照 Tensor Llama TensorRT ONNX Vim Attention llama.cpp Qwen Sklearn GoogLeNet OpenAI CSV UI Land Quantization C++ RGB Pillow Translation Datetime transformers MD5 Magnet Clash Ubuntu Heatmap LLAMA PDB Paper Anaconda CEIR Zip CUDA Git 签证 Streamlit Docker 报税 torchinfo NameSilo Paddle v0.dev Web Michelin Bin Color Website SPIE EXCEL SQL API Logo mmap Qwen2.5 uWSGI CV RAR Use Excel Review Pandas Shortcut tar LoRA Hotel Google Permission scipy BTC OpenCV CAM ChatGPT hf Baidu PyCharm Random Mixtral Pickle Video 腾讯云 JSON logger v2ray Input Password Hungarian Github BeautifulSoup TensorFlow PDF VPN Statistics Dataset FastAPI Template Markdown FlashAttention Bert GPTQ FP32 Gemma Algorithm VGG-16 搞笑 Nginx SQLite FP8 Image2Text Tiktoken Quantize 阿里云 COCO OCR printf Numpy XML ResNet-50 Interview Card Breakpoint VSCode DeepSeek Linux git-lfs FP64 Plate YOLO PIP GGML 公式 GIT UNIX WebCrawler LLM Firewall AI TSV ModelScope 飞书关于博主 tqdm uwsgi Transformers CTC GPT4 算法题 PyTorch Ptyhon BF16 Conda Data Claude Cloudreve LeetCode QWEN XGBoost 域名 Disk Knowledge InvalidArgumentError Proxy Qwen2 Food Bipartite SVR Plotly NLTK HuggingFace Augmentation Bitcoin Vmess Python Tracking Pytorch NLP Math HaggingFace Hilton Jetson diffusers Safetensors Django

站点统计

本站现有博文295篇,共被浏览644664次

本站已经建立2224天!

热门文章

文章归档