EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    Tensor Dataset GGML Qwen2 LaTeX git COCO SQL Website CSV uWSGI WAN RGB CC Anaconda Search LLAMA Hilton UI Bert Shortcut WebCrawler Algorithm BTC TTS Bitcoin TensorRT HaggingFace Distillation mmap GPTQ tar Heatmap Bin CV 报税 Plate Michelin VGG-16 Ptyhon 图形思考法 Clash Streamlit CLAP Django Review MD5 版权 CTC OpenCV Markdown Gemma 阿里云 TensorFlow YOLO CUDA Diagram Tiktoken Augmentation BF16 第一性原理 PIP Food Pickle Magnet Password FP8 uwsgi llama.cpp PDF 继承 域名 ONNX QWEN v0.dev Pillow Paper Breakpoint Conda InvalidArgumentError torchinfo 证件照 Numpy VSCode Logo Statistics 多进程 v2ray 搞笑 FP32 Data SVR Card Web BeautifulSoup Bipartite Template EXCEL Pytorch Video XGBoost LeetCode tqdm Baidu Mixtral Qwen scipy NLTK HuggingFace Math Transformers JSON FP16 Animate GPT4 Crawler Input DeepStream XML Git Land TSV transformers 顶会 Base64 Agent Plotly News Tracking Vim GIT IndexTTS2 C++ Firewall Windows Permission Jupyter SPIE Hotel Cloudreve Github Interview Excel PDB Knowledge 多线程 git-lfs Sklearn 腾讯云 Attention Zip 签证 Safetensors Use Hungarian CAM Translation LLM FastAPI Quantization SQLite printf Proxy UNIX Claude Vmess Llama 强化学习 Ubuntu Nginx NLP PyTorch ChatGPT 财报 logger NameSilo 公式 Python API Google OpenAI Miniforge CEIR hf FlashAttention Quantize OCR Image2Text diffusers 音频 Color Qwen2.5 Linux ModelScope GoogLeNet Paddle PyCharm Random 飞书 RAR ResNet-50 递归学习法 Pandas FP64 DeepSeek Disk Domain 算法题 Datetime AI Freesound 关于博主 Jetson Docker LoRA SAM 净利润 VPN
    站点统计

    本站现有博文320篇,共被浏览759161

    本站已经建立2427天!

    热门文章
    文章归档
    回到顶部