EADST

Save Hugging Face Model with One Bin

max_shard_size (int or str, optional, defaults to "10GB") — Only applicable for models. The maximum size for a checkpoint before being sharded. Checkpoints shard will then be each of size lower than this size. If expressed as a string, needs to be digits followed by a unit (like "5MB").

Based on the introduction, one bin model can be saved by changing the "max_shard_size".

LlamaForCausalLM.save_pretrained(base_model, output_dir, max_shard_size="100GB") # save one bin if the model is less than 100GB

Reference

PreTrainedModel

About Me
XD
Goals determine what you are going to be.
Category
标签云
Distillation Vim 报税 DeepSeek Breakpoint Pickle Input Color Pytorch OpenCV 第一性原理 VSCode TensorRT Zip Bin Dataset Random Gemma Image2Text NameSilo Website Qwen2 printf 飞书 VGG-16 强化学习 ResNet-50 YOLO 递归学习法 UNIX ONNX GPT4 LaTeX CAM Quantize 净利润 证件照 继承 Conda CLAP GIT 关于博主 SQLite CTC Sklearn TTS UI icon HuggingFace Land LLM Qwen2.5 RGB Statistics mmap 多进程 NLP Password Safetensors Crawler Docker 版权 PDF tar 音频 Augmentation v0.dev Shortcut Hotel 顶会 Tracking Plate Hungarian WebCrawler CC Vmess Tensor Web SPIE Review Interview TSV Llama Python Transformers git C++ Freesound Numpy 图标 Heatmap Nginx VPN MD5 Proxy Clash Paddle Animate Cloudreve InvalidArgumentError scipy Anaconda FlashAttention Jetson Knowledge Diagram Bert FastAPI transformers LoRA 公式 Search OCR Ubuntu Bitcoin logger PDB Translation FP64 Claude CEIR DeepStream diffusers Markdown Math API Google 腾讯云 Video Datetime XML News v2ray hf GPTQ BeautifulSoup 搞笑 Pandas RAR Tiktoken Disk Ptyhon Algorithm uWSGI AI Base64 HaggingFace 算法题 FP32 llama.cpp Baidu 图形思考法 PyCharm Michelin Streamlit Excel CSV Logo Template SQL CV Attention Miniforge git-lfs PIP SVR LLAMA WAN XGBoost 多线程 LeetCode FP8 tqdm Jupyter Food Card 域名 JSON Linux SAM Data Use Paper Hilton TensorFlow Magnet FP16 IndexTTS2 Permission 云服务器 COCO EXCEL BTC 阿里云 ModelScope 签证 Firewall Django Plotly 财报 NLTK Qwen torchinfo Mixtral Bipartite uwsgi OpenAI GoogLeNet Agent QWEN BF16 Domain Git Github Quantization Windows ChatGPT Pillow CUDA PyTorch GGML
站点统计

本站现有博文322篇,共被浏览790521

本站已经建立2486天!

热门文章
文章归档
回到顶部