EADST

LLAMA Model Save with INT8 Format

LLAMA Model Save with INT8 Format

from transformers import BitsAndBytesConfig
from transformers import AutoModelForCausalLM

config = BitsAndBytesConfig(
    load_in_8bit=True,
)
path = "/home/llm/model/path/"
model = AutoModelForCausalLM.from_pretrained(path, device_map="cpu", quantization_config=config)
model.save_pretrained("model_save_folder-8bit")
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Qwen2.5 SQLite 公式 FlashAttention VGG-16 FP32 Search Jetson Pillow 继承 RAR Streamlit TTS GIT Qwen DeepSeek Proxy uWSGI HaggingFace 关于博主 NLTK ChatGPT Python Knowledge Windows 阿里云 Ubuntu 图形思考法 Color Video Base64 Crawler SQL 财报 CEIR Ptyhon Hilton DeepStream API Safetensors OCR IndexTTS2 LLAMA Anaconda Freesound GPT4 报税 顶会 Input printf Magnet TensorFlow InvalidArgumentError 音频 logger Statistics YOLO Vmess Numpy Pickle Data Website LaTeX Translation 腾讯云 diffusers v0.dev XGBoost Datetime VSCode ONNX JSON NameSilo QWEN Miniforge PDF FP8 Gemma OpenCV GGML Google Michelin Land UNIX Diagram PyTorch Conda Domain VPN PIP git-lfs Template 云服务器 Math 算法题 RGB Markdown Vim 搞笑 WebCrawler Bert Tracking Nginx Web Excel transformers SPIE hf FP16 Password Sklearn torchinfo Image2Text 递归学习法 Pandas 多进程 WAN CC CV Tiktoken ResNet-50 LeetCode 第一性原理 Augmentation SAM Bin Heatmap Docker Interview Plotly icon BTC Github CLAP MD5 Shortcut Django Linux Algorithm CSV Card ModelScope COCO PDB AI Attention Qwen2 Distillation Zip LLM CAM Breakpoint Bitcoin Paddle SVR 强化学习 GoogLeNet EXCEL 版权 Mixtral Hungarian Agent Clash FP64 TensorRT BeautifulSoup Cloudreve CUDA Git Paper NLP Pytorch tar XML Random News Quantize Rebuttal FastAPI Baidu Disk 净利润 域名 Jupyter Llama 证件照 Tensor LoRA Permission 飞书 BF16 HuggingFace v2ray git 图标 PyCharm 多线程 CTC scipy 签证 UI Logo Animate Review TSV C++ tqdm Quantization Dataset Claude Plate Firewall Food GPTQ llama.cpp Bipartite Use uwsgi mmap OpenAI Hotel Transformers
站点统计

本站现有博文324篇,共被浏览822404

本站已经建立2527天!

热门文章
文章归档
回到顶部