EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    Dataset 强化学习 SQLite Shortcut Hotel Pytorch Proxy Bert CLAP TensorRT Agent Docker git Permission BeautifulSoup Search 版权 Distillation Knowledge TTS FP64 Review Numpy 算法题 git-lfs Random IndexTTS2 Vmess Windows Math 继承 HaggingFace Video Transformers 关于博主 飞书 Password Food Qwen2.5 BF16 XML Breakpoint Gemma Heatmap Card Ubuntu v0.dev COCO JSON Baidu Diagram 腾讯云 DeepSeek Llama diffusers WebCrawler EXCEL 多线程 YOLO OpenAI Attention 音频 财报 mmap Land Mixtral Zip Streamlit Qwen2 报税 LoRA Web PIP Plotly scipy Linux DeepStream AI Algorithm NLTK Domain Tensor VGG-16 Translation TSV SPIE Jetson Miniforge v2ray Template Image2Text Firewall XGBoost GPT4 uwsgi UI CTC Base64 PyTorch Pandas Crawler Quantize CAM Clash Bipartite Django tqdm C++ Data logger 净利润 PDB 图形思考法 CEIR Input ChatGPT Cloudreve 公式 Tiktoken CUDA VPN Datetime FlashAttention Freesound CC Disk printf Bin RGB llama.cpp Google Ptyhon Color Animate Logo PDF Use Interview Conda Git GPTQ NameSilo Github Excel QWEN 多进程 FP32 hf LLAMA VSCode SAM BTC MD5 Plate Jupyter FP8 LaTeX TensorFlow Claude NLP Magnet GIT 证件照 Paddle Quantization GoogLeNet 阿里云 Tracking Sklearn CV UNIX FastAPI uWSGI LeetCode Markdown Statistics 搞笑 Website torchinfo Hungarian News tar InvalidArgumentError Anaconda Pillow RAR 域名 递归学习法 API Safetensors WAN Michelin transformers Hilton GGML Pickle Augmentation LLM ModelScope CSV ResNet-50 OCR Paper OpenCV SQL PyCharm HuggingFace Bitcoin FP16 Qwen Python Nginx SVR 第一性原理 签证 ONNX 顶会 Vim
    站点统计

    本站现有博文320篇,共被浏览763478

    本站已经建立2439天!

    热门文章
    文章归档
    回到顶部