EADST

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
  • 1
  • About Me
    XD
    Goals determine what you are going to be.
    Category
    标签云
    GPTQ Math Github BTC CC Interview AI 公式 算法题 JSON OCR Color Land News NLP 顶会 git-lfs Vmess PyTorch Pandas CUDA TTS tar GIT Jupyter Magnet torchinfo UNIX TSV Base64 VPN Cloudreve git Input Claude API 腾讯云 Vim uWSGI Algorithm Sklearn 强化学习 Image2Text Knowledge HuggingFace Clash Ptyhon Domain mmap Video 云服务器 Conda Llama Gemma Disk Python 净利润 Permission CV Tensor Tracking Excel OpenAI Pytorch 财报 Bitcoin 第一性原理 搞笑 v2ray TensorRT Freesound llama.cpp Breakpoint SAM 签证 SVR Data Quantize Transformers Diagram Bin PDF LLM Review LeetCode DeepSeek Translation VGG-16 FP64 继承 scipy Qwen2 LLAMA TensorFlow ResNet-50 Random transformers 阿里云 GPT4 SQL ModelScope CSV OpenCV SPIE 飞书 Django FP32 Miniforge Streamlit Anaconda Tiktoken Pillow LoRA Search FP16 Linux NLTK FastAPI Jetson BF16 GGML COCO Mixtral Zip Paddle PIP Shortcut YOLO Plotly Bert Website v0.dev 多线程 uwsgi printf Qwen2.5 Markdown Use Animate QWEN Docker Attention CLAP Password Hotel GoogLeNet Template CEIR PDB 递归学习法 Baidu Firewall 版权 域名 Agent Datetime Distillation RAR Card 报税 Ubuntu VSCode Augmentation FP8 Web Heatmap Numpy Paper Git logger 关于博主 ChatGPT WAN Crawler Qwen SQLite MD5 多进程 WebCrawler Michelin Google Statistics BeautifulSoup RGB XML CTC Proxy HaggingFace Quantization Bipartite Hungarian FlashAttention tqdm diffusers EXCEL XGBoost UI LaTeX ONNX Pickle Food IndexTTS2 CAM hf 图形思考法 证件照 InvalidArgumentError Dataset 音频 PyCharm Hilton Windows Nginx DeepStream NameSilo Logo Plate C++ Safetensors
    站点统计

    本站现有博文321篇,共被浏览767754

    本站已经建立2451天!

    热门文章
    文章归档
    回到顶部