EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    证件照 Datetime FP32 图形思考法 Hotel Numpy llama.cpp Review logger FP8 Llama Shortcut Vim API Image2Text RGB Statistics Quantize Translation Qwen2 YOLO SQLite Distillation tqdm Ptyhon 净利润 Markdown Tracking UNIX 继承 BeautifulSoup OCR CSV Nginx JSON LLAMA VPN printf DeepSeek Mixtral FP64 Git VSCode ChatGPT Use Template Claude Baidu Michelin TensorFlow Pillow ModelScope diffusers Augmentation Linux Excel GIT SPIE Land Python CC Hilton Dataset Tensor Permission mmap MD5 FastAPI Plotly Windows PyTorch GoogLeNet Hungarian Zip transformers 阿里云 音频 CAM Firewall Cloudreve 多线程 Plate CTC Input 顶会 HaggingFace 搞笑 Animate 递归学习法 Django WebCrawler Card 多进程 Docker LeetCode Streamlit Bipartite Heatmap torchinfo Pytorch FlashAttention Video PIP v0.dev 关于博主 Clash DeepStream Vmess Disk PDF git-lfs HuggingFace Crawler Conda scipy Qwen2.5 报税 IndexTTS2 Jetson 版权 公式 BTC Knowledge 域名 Bin Proxy LoRA XGBoost Base64 签证 Gemma 第一性原理 Bert Domain PyCharm EXCEL Logo TTS uWSGI Algorithm Food ResNet-50 Qwen Paper Quantization OpenAI NLP Magnet TSV git UI SVR WAN Diagram Jupyter GPT4 COCO Attention Paddle 强化学习 Google TensorRT Website RAR Breakpoint SQL VGG-16 算法题 Miniforge LLM Safetensors CLAP 财报 Anaconda BF16 Random Password hf Transformers Ubuntu InvalidArgumentError CV CEIR NLTK LaTeX Color AI ONNX Tiktoken OpenCV SAM 飞书 QWEN Pickle Web PDB 腾讯云 Agent Pandas Bitcoin GPTQ v2ray Math Github FP16 Data tar Interview C++ Freesound Search uwsgi NameSilo CUDA Sklearn XML GGML
    站点统计

    本站现有博文319篇,共被浏览750145

    本站已经建立2403天!

    热门文章
    文章归档
    回到顶部