EADST

Save Hugging Face Model with One Bin

max_shard_size (int or str, optional, defaults to "10GB") — Only applicable for models. The maximum size for a checkpoint before being sharded. Checkpoints shard will then be each of size lower than this size. If expressed as a string, needs to be digits followed by a unit (like "5MB").

Based on the introduction, one bin model can be saved by changing the "max_shard_size".

LlamaForCausalLM.save_pretrained(base_model, output_dir, max_shard_size="100GB") # save one bin if the model is less than 100GB

Reference

PreTrainedModel

About Me
XD
Goals determine what you are going to be.
Category
标签云
NLTK Freesound BF16 API Data 继承 Jetson 顶会 WebCrawler Land Safetensors Docker torchinfo CAM Pandas Permission Bitcoin QWEN ONNX PDB 签证 LaTeX GoogLeNet git-lfs 搞笑 关于博主 VGG-16 Translation Input Cloudreve Bipartite UNIX 算法题 VPN Zip NLP C++ logger PyCharm Pillow Baidu VSCode Augmentation Color Vim Ubuntu SPIE 报税 递归学习法 Attention Website hf Algorithm llama.cpp UI Disk XGBoost Miniforge XML TensorRT Image2Text Numpy Random Qwen2 Gemma Diagram LoRA LLAMA Git Breakpoint mmap Sklearn Python AI Interview 净利润 Shortcut Web ModelScope FP16 FP32 Statistics Tiktoken scipy Conda PyTorch Qwen OCR OpenAI Pickle FP8 uWSGI CUDA GIT FlashAttention Llama DeepSeek Math LeetCode MD5 证件照 DeepStream Jupyter SAM HaggingFace 版权 RAR 阿里云 Proxy Clash Ptyhon tqdm OpenCV Quantize Nginx SQLite Dataset Tensor Food Hungarian YOLO ResNet-50 CC CLAP EXCEL Bin SQL Hotel WAN Plate 图形思考法 第一性原理 CEIR Base64 强化学习 Pytorch LLM tar CSV GPTQ Card COCO FP64 diffusers IndexTTS2 Paddle v2ray GPT4 Datetime git PDF Firewall Bert TTS 多进程 Crawler transformers Github Heatmap 音频 Anaconda Animate InvalidArgumentError 财报 JSON v0.dev Transformers printf News 飞书 uwsgi Password Windows GGML Review Template Google Claude Django BeautifulSoup CTC 多线程 Vmess 域名 Paper PIP FastAPI 公式 Quantization Markdown CV Mixtral Search Excel Qwen2.5 Video Knowledge 腾讯云 Michelin Logo ChatGPT 云服务器 Distillation BTC SVR Streamlit Agent TensorFlow Use Plotly Magnet HuggingFace Tracking NameSilo Linux RGB TSV Hilton Domain
站点统计

本站现有博文321篇,共被浏览763689

本站已经建立2439天!

热门文章
文章归档
回到顶部