EADST

Save Hugging Face Model with One Bin

max_shard_size (int or str, optional, defaults to "10GB") — Only applicable for models. The maximum size for a checkpoint before being sharded. Checkpoints shard will then be each of size lower than this size. If expressed as a string, needs to be digits followed by a unit (like "5MB").

Based on the introduction, one bin model can be saved by changing the "max_shard_size".

LlamaForCausalLM.save_pretrained(base_model, output_dir, max_shard_size="100GB") # save one bin if the model is less than 100GB

Reference

PreTrainedModel

About Me
XD
Goals determine what you are going to be.
Category
标签云
Plate C++ MD5 Paddle 域名 v0.dev 版权 Hilton 云服务器 Windows 报税 Attention Data Tracking Video 证件照 强化学习 Numpy Quantize v2ray scipy diffusers CSV 搞笑 printf Base64 Claude Clash Bipartite Qwen 算法题 IndexTTS2 Password Search Hotel FastAPI Jupyter 音频 VPN WebCrawler Python Sklearn FP16 Plotly 图标 TTS RGB Use Bitcoin Land git-lfs Ubuntu SAM NLTK ModelScope Streamlit Excel Tiktoken TensorFlow icon Mixtral Github Zip tar 腾讯云 图形思考法 顶会 Translation LLAMA TSV Image2Text 飞书 VSCode tqdm llama.cpp SQLite CLAP 签证 多进程 Git Nginx PDF FP64 Template 论文 NLP Dataset Input XML Datetime COCO LeetCode RL ResNet-50 Vmess Permission ONNX Math Linux BeautifulSoup 递归学习法 Freesound Rebuttal Breakpoint Bert Quantization torchinfo uwsgi Knowledge Baidu mmap Anaconda XGBoost Gemma CAM 第一性原理 Diagram 继承 ChatGPT SQL Vim Google Animate PyCharm Paper Pillow Tensor Magnet YOLO BTC Qwen2.5 OpenAI Firewall Pytorch Interview OpenCV DeepSeek GoogLeNet FP32 Crawler Conda BF16 git Proxy PDB QWEN Transformers LLM Review CUDA logger Color Website API AI InvalidArgumentError ms-swift WAN transformers TensorRT CTC PIP UNIX Hungarian 财报 Web PyTorch FP8 SPIE CEIR Llama HuggingFace Algorithm uWSGI 阿里云 论文速读 Card Statistics 净利润 OCR Logo RAR 多线程 GGML Michelin GPT4 GIT Qwen2 Pickle Cloudreve Distillation News HaggingFace Food Random LaTeX VGG-16 DeepStream Miniforge Bin Agent Ptyhon UI SVR FlashAttention Augmentation Django CC Pandas Disk LoRA Heatmap Safetensors Domain hf GPTQ NameSilo CV Shortcut Docker Jetson 关于博主 Markdown EXCEL JSON 公式
站点统计

本站现有博文332篇,共被浏览867468

本站已经建立2575天!

热门文章
文章归档
回到顶部