EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    Food Google OCR Augmentation COCO Use FP32 Vmess Baidu 域名 OpenAI Algorithm CV Interview FastAPI Cloudreve XGBoost Numpy Pillow Hungarian 算法题 Safetensors 报税 Plotly logger Base64 BF16 净利润 Vim TensorFlow Github Claude SPIE DeepSeek Windows Paddle git Zip v2ray 音频 C++ Sklearn Attention NLTK Heatmap mmap UI SQLite VGG-16 Python LLAMA Logo Proxy FP16 GGML uWSGI transformers Tracking printf OpenCV 飞书 Qwen2.5 GIT 公式 News PDB Search Git RGB BTC Distillation CAM Shortcut 签证 Jupyter Image2Text tar 证件照 TTS Translation Hilton Ubuntu CTC YOLO Excel Paper Video API git-lfs NameSilo FP8 QWEN HuggingFace Pytorch SAM Input Random CC Agent Bert AI LeetCode GoogLeNet PIP RAR Quantize Dataset PDF Math WAN Nginx ONNX TensorRT ResNet-50 v0.dev Template ChatGPT CLAP CEIR XML Quantization Crawler Bitcoin Permission Qwen2 Web LLM NLP Website Markdown Tiktoken VSCode uwsgi Jetson LoRA 多线程 Knowledge Clash Land hf HaggingFace Gemma Qwen 搞笑 PyTorch Anaconda Miniforge 多进程 InvalidArgumentError scipy CSV 顶会 Pandas 腾讯云 Disk CUDA Statistics GPTQ Animate LaTeX IndexTTS2 Streamlit Bin EXCEL Django Michelin FP64 Bipartite Firewall 版权 SQL torchinfo Pickle VPN TSV 财报 Conda Password BeautifulSoup PyCharm Llama SVR 图形思考法 Review Breakpoint DeepStream MD5 强化学习 继承 GPT4 递归学习法 Color Ptyhon Card Magnet UNIX Hotel 阿里云 llama.cpp Docker Linux Transformers tqdm Datetime FlashAttention Tensor Mixtral WebCrawler Plate diffusers Freesound Diagram 关于博主 Data Domain ModelScope JSON 第一性原理
    站点统计

    本站现有博文320篇,共被浏览759629

    本站已经建立2428天!

    热门文章
    文章归档
    回到顶部