EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    Transformers Input Google Diagram SAM SQL Password Windows CTC Cloudreve Jetson Llama PDB Data API 音频 证件照 NLP SVR Pickle Pytorch Clash Hungarian LoRA Miniforge WAN transformers Plotly RGB Firewall OpenAI Bert FP32 SPIE 腾讯云 git-lfs Ubuntu Disk PyTorch XGBoost Pandas VPN CC InvalidArgumentError Logo Augmentation DeepSeek Color Quantization tar ResNet-50 CEIR HaggingFace BF16 LeetCode IndexTTS2 Translation logger Qwen2 GPTQ printf Animate TTS ChatGPT uwsgi git Base64 净利润 Michelin Math WebCrawler scipy Video YOLO Mixtral Food Freesound 版权 v0.dev LLM Permission CV Web Markdown Zip Hilton Card OCR UNIX 关于博主 Breakpoint Crawler Website Vmess llama.cpp Domain DeepStream Interview XML LaTeX JSON 签证 Tiktoken RAR Attention TensorRT Linux CAM Sklearn 公式 Tracking 财报 Vim Use PIP UI Anaconda Nginx Knowledge Proxy 域名 Review Plate Github uWSGI Streamlit hf diffusers torchinfo 飞书 Django Image2Text Statistics FP8 Gemma EXCEL VSCode LLAMA Template Docker ONNX CLAP NLTK Distillation Shortcut v2ray mmap CSV SQLite FP64 tqdm PyCharm Claude NameSilo Qwen BTC VGG-16 FlashAttention Magnet Jupyter Pillow MD5 Hotel Bin C++ Baidu Excel FP16 Heatmap Tensor 搞笑 算法题 GIT ModelScope Qwen2.5 Dataset 多线程 Ptyhon Land Algorithm Numpy Random GoogLeNet GPT4 Bitcoin FastAPI TSV PDF 继承 QWEN Python HuggingFace Quantize TensorFlow Conda AI COCO 多进程 阿里云 OpenCV GGML BeautifulSoup Datetime Git Paddle Paper Safetensors 报税 CUDA Bipartite
    站点统计

    本站现有博文309篇,共被浏览732831

    本站已经建立2370天!

    热门文章
    文章归档
    回到顶部