EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    Pytorch Linux CAM Plotly Attention TSV Qwen2.5 Hungarian 算法题 UNIX WAN Anaconda DeepStream Qwen torchinfo OpenAI CTC Vim 净利润 Statistics TTS SQL scipy Use Magnet Vmess Web Tensor PyCharm Gemma SQLite Domain QWEN API tqdm Baidu BF16 Distillation CV CSV Paper AI HuggingFace CLAP VSCode Animate Hotel LeetCode 多线程 Knowledge BeautifulSoup Ptyhon Interview logger 阿里云 Math NLP Python Pandas Numpy Pickle SAM HaggingFace llama.cpp MD5 RAR Input InvalidArgumentError 域名 NameSilo Safetensors Miniforge Bitcoin Website Land GPT4 FP16 EXCEL 搞笑 Crawler Claude Nginx transformers Food Hilton Clash Plate JSON Zip Jupyter Docker Image2Text Datetime Qwen2 Mixtral WebCrawler 多进程 LLAMA UI 腾讯云 FP64 Sklearn PIP 飞书 Card GPTQ COCO Paddle Windows Quantization uWSGI Template Google BTC Streamlit Llama VGG-16 Excel VPN printf 证件照 ModelScope TensorRT Jetson Password Review ONNX OpenCV Random v2ray LoRA v0.dev Bipartite Quantize Color Algorithm Disk Permission Shortcut tar PDF Base64 财报 IndexTTS2 hf FlashAttention Tracking git-lfs Firewall 关于博主 Markdown Dataset Cloudreve Bin Ubuntu TensorFlow git 报税 Git YOLO DeepSeek Pillow CC NLTK 版权 Transformers SPIE 公式 Augmentation GIT SVR Bert FP32 Breakpoint ResNet-50 FP8 C++ OCR diffusers mmap Tiktoken CUDA CEIR Freesound Translation Django XGBoost 音频 签证 XML RGB Heatmap Proxy FastAPI ChatGPT LaTeX GGML Logo LLM Michelin Video Conda Diagram GoogLeNet PyTorch PDB 继承 uwsgi Github Data
    站点统计

    本站现有博文311篇,共被浏览743013

    本站已经建立2383天!

    热门文章
    文章归档
    回到顶部