EADST

Save Hugging Face Model with One Bin

max_shard_size (int or str, optional, defaults to "10GB") — Only applicable for models. The maximum size for a checkpoint before being sharded. Checkpoints shard will then be each of size lower than this size. If expressed as a string, needs to be digits followed by a unit (like "5MB").

Based on the introduction, one bin model can be saved by changing the "max_shard_size".

LlamaForCausalLM.save_pretrained(base_model, output_dir, max_shard_size="100GB") # save one bin if the model is less than 100GB

Reference

PreTrainedModel

About Me
XD
Goals determine what you are going to be.
Category
标签云
Python Mixtral Safetensors SQL 多进程 Web Image2Text JSON 音频 Disk ChatGPT 阿里云 公式 Firewall 算法题 Hungarian Card AI diffusers Nginx Qwen2 printf Excel 继承 Input HaggingFace Anaconda ResNet-50 Dataset 图形思考法 Freesound transformers GPT4 LoRA OCR Streamlit Jetson API Breakpoint NLTK Qwen2.5 腾讯云 Color torchinfo RGB CV Vmess Google Land Linux News UNIX Michelin Tracking VGG-16 SVR GIT FP32 LeetCode Crawler Tensor LaTeX Password PyTorch Ubuntu FlashAttention TensorFlow PDB mmap Pytorch hf PIP CAM Paper scipy GPTQ Random BF16 CEIR Shortcut Base64 Permission v0.dev Use Paddle 飞书 CC GGML TSV logger PyCharm Animate Transformers 报税 CSV Bitcoin Github NameSilo Tiktoken icon BeautifulSoup Docker git-lfs HuggingFace Baidu Pillow MD5 WebCrawler Agent uWSGI ONNX IndexTTS2 Search FP64 Vim DeepStream Bipartite Plate DeepSeek C++ tar LLAMA InvalidArgumentError Clash QWEN 顶会 Qwen 签证 Template Hilton Llama SQLite Datetime LLM Knowledge WAN Plotly COCO Claude Git Gemma Numpy Domain Conda 云服务器 Diagram VPN Website llama.cpp Logo TensorRT Data XGBoost Translation Food 多线程 Statistics ModelScope 财报 Attention Quantization VSCode Proxy 递归学习法 TTS YOLO Quantize XML CTC 搞笑 Video Augmentation 域名 FastAPI Jupyter Zip SAM Algorithm tqdm PDF Sklearn Django FP16 CLAP UI Magnet Ptyhon Windows NLP SPIE BTC Pickle 版权 Bin EXCEL 关于博主 v2ray Heatmap Hotel 第一性原理 Miniforge Bert Markdown GoogLeNet OpenCV 净利润 Review uwsgi OpenAI 证件照 Cloudreve Pandas 强化学习 FP8 Math RAR Interview 图标 Distillation Rebuttal git CUDA
站点统计

本站现有博文324篇,共被浏览808707

本站已经建立2511天!

热门文章
文章归档
回到顶部