EADST

FP8位数解析

在 AI 模型越来越庞大的今天,我们面临的不仅是算力挑战,更有带宽、能耗和模型部署的瓶颈。正因如此,更高效的数值表示方式成为突破口,其中最受关注的就是 FP8(8位浮点数)格式。

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
About Me
XD
Goals determine what you are going to be.
Category
标签云
站点统计

本站现有博文289篇,共被浏览579228

本站已经建立2160天!

热门文章
文章归档
回到顶部