LLAMA Model Save with INT8 Format| 东毅居士

LLAMA Model Save with INT8 Format

作者：XD / 发表： 2023年7月31日 02:51 / 更新： 2023年7月31日 02:51 / 编程笔记 / 阅读量：1795

LLAMA Model Save with INT8 Format

from transformers import BitsAndBytesConfig
from transformers import AutoModelForCausalLM

config = BitsAndBytesConfig(
    load_in_8bit=True,
)
path = "/home/llm/model/path/"
model = AutoModelForCausalLM.from_pretrained(path, device_map="cpu", quantization_config=config)
model.save_pretrained("model_save_folder-8bit")

本文作者：XD 转载请标明出处：http://www.eadst.com/blog/199

本站采用知识共享署名-非商业性使用-相同方式共享 4.0 国际许可协议进行许可。

上一篇
Baidu Translation API Code

下一篇
Save Hugging Face Model with One Bin

Category

标签云

PyCharm 算法题公式 Vim NLP Permission CSV 关于博主 CC Paper VPN PDF HaggingFace FlashAttention ResNet-50 printf ONNX Dataset VGG-16 Breakpoint tqdm Pillow Distillation torchinfo GIT Use Card v2ray Translation RAR CUDA llama.cpp Food 多线程 FP64 阿里云 PDB TensorRT git SVR LoRA SQLite Domain Bipartite Proxy Michelin 音频 Gemma Crawler GGML Docker Magnet Template Tiktoken 域名 YOLO Django GoogLeNet HuggingFace transformers LLAMA Anaconda Color RGB Website Qwen2 PIP Transformers API Qwen Pandas Base64 OpenAI Quantize FP16 Disk DeepStream diffusers Knowledge ModelScope Logo Paddle GPTQ Sklearn Conda SPIE Statistics 财报 Google Hungarian LeetCode Mixtral CEIR BF16 多进程 CAM WAN Plate Math Python TensorFlow Review Streamlit Pytorch 腾讯云 PyTorch Jupyter Heatmap Attention Firewall Baidu ChatGPT LaTeX JSON Shortcut CV Windows NameSilo DeepSeek 飞书 Clash Hilton scipy Bitcoin uWSGI COCO Web C++ Pickle Safetensors Freesound Linux 签证 UNIX 净利润 Augmentation Datetime hf FP8 Cloudreve CLAP Git Video VSCode tar Hotel Zip FastAPI UI Tensor LLM AI 继承 XGBoost FP32 Github Bin TTS Numpy Claude WebCrawler NLTK Algorithm Image2Text MD5 Llama mmap Data uwsgi Random SQL Qwen2.5 v0.dev QWEN Diagram CTC Interview Ubuntu OCR Bert Ptyhon Nginx logger Plotly 搞笑 Animate 报税 InvalidArgumentError Tracking Jetson BTC XML Vmess TSV Password Input Quantization GPT4 Land 版权 OpenCV EXCEL git-lfs Markdown BeautifulSoup Excel 证件照

站点统计

本站现有博文305篇,共被浏览713376次

本站已经建立2335天!

原 LLAMA Model Save with INT8 Format

作者：XD / 发表： 2023年7月31日 02:51 / 更新： 2023年7月31日 02:51 / 编程笔记 / 阅读量：1795

LLAMA Model Save with INT8 Format