EADST

Save Hugging Face Model with One Bin

max_shard_size (int or str, optional, defaults to "10GB") — Only applicable for models. The maximum size for a checkpoint before being sharded. Checkpoints shard will then be each of size lower than this size. If expressed as a string, needs to be digits followed by a unit (like "5MB").

Based on the introduction, one bin model can be saved by changing the "max_shard_size".

LlamaForCausalLM.save_pretrained(base_model, output_dir, max_shard_size="100GB") # save one bin if the model is less than 100GB

Reference

PreTrainedModel

About Me
XD
Goals determine what you are going to be.
Category
标签云
Ptyhon uWSGI HaggingFace Interview logger v0.dev BeautifulSoup XGBoost icon Bert Paddle LLM Bin FlashAttention Numpy FP8 Plotly CC Animate Jupyter Magnet hf Template GPT4 Algorithm v2ray Distillation AI Video GoogLeNet 关于博主 BF16 RAR Agent Logo git 递归学习法 XML Disk Math Statistics 论文速读 mmap 阿里云 Sklearn 公式 Rebuttal 论文 CV 净利润 scipy 域名 NLTK Github Heatmap CEIR SVR GGML WAN Hilton ResNet-50 JSON Linux News ModelScope llama.cpp Vmess torchinfo DeepSeek 搞笑 腾讯云 Image2Text Claude TSV Cloudreve 强化学习 Baidu Pytorch Python GPTQ DeepStream MD5 WebCrawler PIP ChatGPT Domain API TTS Shortcut InvalidArgumentError uwsgi Food PyTorch Safetensors Tiktoken LLAMA Qwen 证件照 Docker Proxy Color PDB Website Freesound tar OpenCV Windows Plate Web Paper Jetson GIT Google Augmentation 算法题 UI Mixtral TensorRT FP64 C++ diffusers OpenAI git-lfs UNIX tqdm 第一性原理 NLP 多线程 Breakpoint 图形思考法 Dataset HuggingFace EXCEL 报税 Markdown ONNX PDF CSV 云服务器 Qwen2.5 Tracking Permission Excel 图标 Zip 顶会 SAM transformers 财报 BTC FastAPI 音频 Attention Firewall VGG-16 Gemma Anaconda Pandas Datetime Hotel TensorFlow 多进程 VSCode OCR Tensor Input Hungarian Password Random Land Data Review Crawler Pickle VPN FP16 Miniforge Search Nginx SQL SQLite LeetCode 飞书 Ubuntu Card CLAP YOLO Base64 Clash Bitcoin COCO Michelin Llama 签证 CTC FP32 Use SPIE printf Translation Git Conda Vim 继承 RGB Pillow IndexTTS2 PyCharm NameSilo CUDA QWEN LaTeX Quantization Knowledge Bipartite Quantize Qwen2 CAM Diagram Streamlit Transformers Django 版权 LoRA
站点统计

本站现有博文328篇,共被浏览847305

本站已经建立2553天!

热门文章
文章归档
回到顶部