EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    Bert CUDA SVR 域名 算法题 Interview v0.dev CV Anaconda Vim AI PDB Jetson 证件照 printf ChatGPT Gemma Google HaggingFace 图形思考法 Plate SAM Math CTC Pickle transformers Michelin Hotel CEIR SQLite git PyTorch Numpy FP64 NLP Input Food Card Firewall Shortcut Windows Hungarian 阿里云 Pandas Data logger OpenAI ONNX VSCode BF16 Conda LeetCode GGML 顶会 Markdown Quantize FP16 Zip TensorRT Website TSV Distillation Pytorch Ubuntu torchinfo Image2Text WAN git-lfs CAM OpenCV Search Review Freesound Safetensors Git 多进程 TTS Breakpoint IndexTTS2 GPTQ 递归学习法 Bipartite Miniforge Land Bin Baidu Clash LaTeX NameSilo Attention JSON 音频 GoogLeNet Translation Datetime Python Color 关于博主 FastAPI PyCharm Agent RAR 签证 scipy LLM SPIE Linux QWEN Knowledge NLTK VPN RGB Disk Jupyter Docker Use Dataset OCR Vmess ResNet-50 PIP Hilton Permission C++ MD5 Web Algorithm XGBoost HuggingFace FlashAttention 腾讯云 WebCrawler COCO Magnet FP32 Statistics Crawler VGG-16 Llama InvalidArgumentError Random Django Animate Nginx GIT SQL UNIX EXCEL Ptyhon DeepStream tqdm Tiktoken Tensor Qwen Base64 Tracking Transformers BeautifulSoup hf FP8 tar uwsgi Diagram Mixtral Sklearn DeepSeek Qwen2.5 Proxy llama.cpp Template Qwen2 mmap XML GPT4 PDF UI ModelScope 公式 CLAP 多线程 News 净利润 报税 LLAMA v2ray API Augmentation CSV Streamlit 第一性原理 Paddle diffusers 搞笑 Pillow LoRA 继承 Paper 财报 飞书 Quantization uWSGI Password CC Plotly Cloudreve BTC 强化学习 Claude Domain TensorFlow Logo Video Github YOLO Heatmap Excel 版权 Bitcoin
    站点统计

    本站现有博文320篇,共被浏览759196

    本站已经建立2427天!

    热门文章
    文章归档
    回到顶部