EADST

Sharding and SafeTensors in Hugging Face Transformers

In the Hugging Face transformers library, managing large models efficiently is crucial, especially when working with limited disk space or specific file size requirements. Two key features that help with this are sharding and the use of SafeTensors.

Sharding

Sharding is the process of splitting a large model's weights into smaller files or "shards." This is particularly useful when dealing with large models that exceed file size limits or when you want to manage storage more effectively.

Usage

To shard a model during the saving process, you can use the max_shard_size parameter in the save_pretrained method. Here's an example:

# Save the model with sharding, setting the maximum shard size to 1GB
model.save_pretrained('./model_directory', max_shard_size="1GB")

In this example, the model's weights will be divided into multiple files, each not exceeding 1GB. This can make storage and transfer more manageable, especially when dealing with large-scale models.

SafeTensors

The safetensors library provides a new format for storing tensors in a safe and efficient way. Unlike traditional formats like PyTorch's .pt files, SafeTensors ensures that the tensor data cannot be accidentally executed as code, offering an additional layer of security. This is particularly important when sharing models across different systems or with the community.

Usage

To save a model using SafeTensors, simply specify the safe_serialization parameter when saving:

# Save the model using SafeTensors format
model.save_pretrained('./model_directory', safe_serialization=True)

This will create files with the .safetensors extension, ensuring the saved tensors are stored safely.

Combining Sharding and SafeTensors

You can combine both sharding and SafeTensors to save a large model securely and efficiently:

# Save the model with sharding and SafeTensors
model.save_pretrained('./model_directory', max_shard_size="1GB", safe_serialization=True)

This setup splits the model into shards, each in the SafeTensors format, offering both manageability and security.

Conclusion

By leveraging sharding and SafeTensors, Hugging Face transformers users can handle large models more effectively. Sharding helps manage file sizes, while SafeTensors ensures the safe storage of tensor data. These features are essential for anyone working with large-scale models, providing both practical and security benefits.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
LeetCode Plotly NameSilo Bert 论文速读 Augmentation v2ray Knowledge JSON ONNX diffusers Sklearn XGBoost printf Image2Text Docker GoogLeNet Firewall Web uwsgi C++ Dataset LLAMA PDB Color 图形思考法 Qwen HuggingFace VSCode Interview Hotel 版权 UI VPN hf CUDA GGML Baidu MD5 Math Numpy Review Pillow Qwen2 PyTorch FP8 Excel Template EXCEL IndexTTS2 llama.cpp BTC Windows 飞书 Video Shortcut UNIX PDF 签证 NLTK Git TensorRT Markdown Python VGG-16 云服务器 GPT4 mmap Qwen2.5 多进程 ResNet-50 Heatmap 净利润 FP64 Gemma CEIR 域名 CC Vmess Translation LLM Pandas Pytorch Base64 CLAP XML uWSGI scipy News Crawler Quantize Tensor Algorithm AI QWEN 证件照 tqdm tar Password v0.dev NLP RAR Vim FP32 Miniforge 音频 递归学习法 第一性原理 Claude Agent LoRA TSV Bin torchinfo OCR 继承 COCO FastAPI WAN 财报 Breakpoint Distillation 多线程 论文 RGB HaggingFace Statistics Google Plate CTC Tiktoken LaTeX BF16 Data Card GIT Jetson PyCharm Transformers Cloudreve Anaconda CV YOLO CSV Ptyhon Search Bitcoin Hungarian FP16 Bipartite SQL logger Llama Tracking InvalidArgumentError Zip Proxy ChatGPT Diagram Permission TensorFlow ModelScope Rebuttal Paper SAM Disk Pickle Hilton DeepStream Safetensors SQLite 关于博主 图标 git Website Conda WebCrawler CAM git-lfs FlashAttention 强化学习 OpenCV DeepSeek 顶会 GPTQ Michelin SVR icon Jupyter PIP Logo Streamlit API Mixtral Random Ubuntu Linux Use transformers 腾讯云 Input Nginx TTS 算法题 搞笑 Clash Animate Attention Food Github 阿里云 Freesound Magnet Quantization 报税 SPIE Paddle Land Django OpenAI 公式 Datetime Domain BeautifulSoup
站点统计

本站现有博文327篇,共被浏览832711

本站已经建立2537天!

热门文章
文章归档
回到顶部