EADST

Save Hugging Face Model with One Bin

max_shard_size (int or str, optional, defaults to "10GB") — Only applicable for models. The maximum size for a checkpoint before being sharded. Checkpoints shard will then be each of size lower than this size. If expressed as a string, needs to be digits followed by a unit (like "5MB").

Based on the introduction, one bin model can be saved by changing the "max_shard_size".

LlamaForCausalLM.save_pretrained(base_model, output_dir, max_shard_size="100GB") # save one bin if the model is less than 100GB

Reference

PreTrainedModel

About Me
XD
Goals determine what you are going to be.
Category
标签云
Color GPT4 递归学习法 Pandas GPTQ UI Use LLM IndexTTS2 Github RGB Random 财报 Baidu Bert FP16 CSV Docker CUDA v2ray Logo tar Linux Tiktoken PDF Pytorch 顶会 关于博主 强化学习 Django Review VPN AI 净利润 WAN TTS UNIX Cloudreve ChatGPT Excel printf CV Bitcoin Tensor torchinfo TSV Plotly BTC Mixtral FP64 SQLite mmap uWSGI Breakpoint Heatmap Transformers Plate VGG-16 ResNet-50 证件照 Anaconda Domain Ptyhon XML Statistics PyCharm FastAPI CLAP NLTK Google FP8 Distillation Augmentation InvalidArgumentError CC transformers YOLO Land MD5 XGBoost Math Safetensors ModelScope Numpy Freesound 多进程 Streamlit PyTorch DeepStream Food BeautifulSoup CAM Llama Magnet PDB TensorFlow SQL 阿里云 Animate DeepSeek Disk 公式 Data SVR Zip BF16 Attention Hungarian 第一性原理 COCO LaTeX Nginx scipy Paddle 签证 Permission Bipartite Tracking Qwen2 Image2Text GGML Crawler C++ Web Michelin Video Knowledge Algorithm Hilton diffusers Windows Gemma QWEN VSCode FlashAttention Ubuntu Hotel Pillow 飞书 Git Quantization 报税 GoogLeNet SAM OCR Diagram HaggingFace FP32 Quantize llama.cpp 算法题 音频 HuggingFace JSON Markdown Qwen2.5 OpenCV hf 多线程 Vmess Pickle Firewall uwsgi LoRA ONNX Dataset Interview RAR tqdm Card 图形思考法 Paper logger Jupyter GIT git Claude WebCrawler Password Jetson NLP 继承 Miniforge PIP SPIE OpenAI v0.dev Shortcut 版权 git-lfs 域名 CTC Sklearn Conda Vim Translation Base64 Qwen Agent EXCEL LLAMA 搞笑 Input 腾讯云 NameSilo CEIR Python Proxy LeetCode Bin TensorRT Search Website News Clash Template Datetime API
站点统计

本站现有博文320篇,共被浏览756684

本站已经建立2421天!

热门文章
文章归档
回到顶部