EADST

Save Hugging Face Model with One Bin

max_shard_size (int or str, optional, defaults to "10GB") — Only applicable for models. The maximum size for a checkpoint before being sharded. Checkpoints shard will then be each of size lower than this size. If expressed as a string, needs to be digits followed by a unit (like "5MB").

Based on the introduction, one bin model can be saved by changing the "max_shard_size".

LlamaForCausalLM.save_pretrained(base_model, output_dir, max_shard_size="100GB") # save one bin if the model is less than 100GB

Reference

PreTrainedModel

About Me
XD
Goals determine what you are going to be.
Category
标签云
Linux XML printf Heatmap Food Random API BF16 Logo RAR tqdm Paddle Land Permission Pickle Review Statistics UNIX Bipartite TTS Tracking Magnet Nginx 财报 Markdown Qwen2 Git Disk Input Password SQLite Qwen Distillation QWEN 顶会 多进程 Pillow LLM SQL Quantize IndexTTS2 Base64 阿里云 Shortcut EXCEL OpenAI 强化学习 域名 Interview logger ResNet-50 Hungarian v2ray OCR PyTorch FP64 算法题 Cloudreve Card Translation TensorRT 飞书 Jupyter 净利润 Qwen2.5 Hotel LoRA Augmentation RGB Tensor Tiktoken NameSilo HaggingFace 论文速读 报税 DeepStream HuggingFace mmap LeetCode Google Plate GPTQ 公式 Gemma Search Website PIP CTC Firewall News VGG-16 Github CEIR git TensorFlow torchinfo 腾讯云 Claude 签证 GPT4 Rebuttal 第一性原理 icon uwsgi Mixtral Bin Plotly ModelScope Template Numpy GoogLeNet Python 图标 Jetson Ubuntu 音频 Ptyhon Safetensors Windows WAN ONNX MD5 Bert LaTeX Video COCO CSV 版权 Breakpoint XGBoost OpenCV transformers llama.cpp InvalidArgumentError DeepSeek SVR GGML 图形思考法 LLAMA Dataset 搞笑 CC SPIE 继承 GIT Sklearn FP32 AI FastAPI Use 多线程 PDF 证件照 FP8 PyCharm Pytorch Paper CV Vim hf 云服务器 NLTK FP16 ChatGPT VSCode 递归学习法 Baidu Streamlit JSON Vmess Pandas YOLO Conda 关于博主 Zip Quantization NLP BTC Diagram Domain Llama scipy Knowledge Hilton Algorithm Anaconda TSV git-lfs uWSGI BeautifulSoup Agent C++ Data v0.dev Image2Text Django CAM Freesound Crawler WebCrawler SAM Bitcoin Datetime Docker Excel Attention Michelin CUDA Web diffusers UI Clash Transformers VPN Proxy PDB Color Animate tar Math Miniforge FlashAttention CLAP
站点统计

本站现有博文326篇,共被浏览825494

本站已经建立2531天!

热门文章
文章归档
回到顶部