EADST

Save Hugging Face Model with One Bin

max_shard_size (int or str, optional, defaults to "10GB") — Only applicable for models. The maximum size for a checkpoint before being sharded. Checkpoints shard will then be each of size lower than this size. If expressed as a string, needs to be digits followed by a unit (like "5MB").

Based on the introduction, one bin model can be saved by changing the "max_shard_size".

LlamaForCausalLM.save_pretrained(base_model, output_dir, max_shard_size="100GB") # save one bin if the model is less than 100GB

Reference

PreTrainedModel

About Me
XD
Goals determine what you are going to be.
Category
标签云
HuggingFace Linux Plotly Conda YOLO Qwen2 SQL UI FP8 Base64 GGML BeautifulSoup 算法题 Bitcoin 飞书 Plate TTS Vim XML 腾讯云 LaTeX Augmentation VSCode BF16 PIP Domain SPIE ChatGPT Card printf Quantize Animate LLAMA NLP Tiktoken Crawler PDF 关于博主 Bin API Datetime Proxy Diagram Magnet 搞笑 Numpy Mixtral Color PDB PyTorch WebCrawler ONNX XGBoost git Pickle Shortcut Llama 净利润 Website Windows VGG-16 NLTK COCO Distillation Logo PyCharm tar OCR Paddle Qwen FP64 Sklearn C++ 域名 WAN logger torchinfo CC SQLite CTC Zip Git Land EXCEL Jupyter CAM Streamlit MD5 InvalidArgumentError Anaconda Safetensors Use Bipartite FP32 LeetCode Review llama.cpp Food CEIR Hotel TensorFlow Random Dataset Django 公式 IndexTTS2 CLAP Hilton Ptyhon hf RGB Password Docker Gemma Firewall LLM TensorRT transformers Knowledge FlashAttention 多线程 证件照 Agent GPT4 Freesound VPN 继承 Github 报税 Cloudreve Attention Claude Excel v0.dev HaggingFace Transformers Pillow Ubuntu Translation OpenAI uWSGI AI OpenCV 多进程 Algorithm ResNet-50 Statistics ModelScope 音频 Pandas uwsgi Heatmap Markdown mmap v2ray diffusers Disk GPTQ DeepSeek RAR Template GoogLeNet 财报 Miniforge NameSilo FP16 Hungarian Tensor Python Jetson Google Input GIT tqdm 版权 Interview Data Michelin git-lfs Qwen2.5 Paper Bert DeepStream Vmess Image2Text LoRA FastAPI Pytorch Clash Web SVR Baidu UNIX TSV SAM BTC Quantization JSON 签证 Video CUDA Math CSV Nginx scipy CV Permission QWEN Tracking Breakpoint 阿里云
站点统计

本站现有博文312篇,共被浏览745764

本站已经建立2392天!

热门文章
文章归档
回到顶部