EADST

CONTINUE READING
CONTINUE READING

FP8位数解析

在 AI 模型越来越庞大的今天,我们面临的不仅是算力挑战,更有带宽、能耗和模型部署的瓶颈。正因如此,更高效的数值表示方式成为突破口,其中最受关注的就是 FP8(8位浮点数)格式。

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING

Quick Review: ZeroQuant-FP

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
CONTINUE READING
About Me
XD
Goals determine what you are going to be.
Category
标签云
Magnet Jetson Logo llama.cpp Miniforge NLP XML Domain Heatmap Freesound Qwen2 FP8 净利润 Input BTC Data Distillation uWSGI 算法题 Safetensors SAM Google 云服务器 VPN git-lfs Shortcut Card 飞书 TensorFlow LLAMA Sklearn FP64 COCO hf Website GPTQ Statistics Algorithm NLTK Quantization Interview uwsgi CV GoogLeNet 阿里云 Windows 版权 CC printf Image2Text OpenCV GGML VGG-16 Animate 音频 搞笑 递归学习法 OCR Django DeepStream Bert Random Base64 Zip SQLite Linux transformers Excel NameSilo Jupyter IndexTTS2 Conda Search QWEN Hotel SQL GIT torchinfo Permission Proxy OpenAI RGB 证件照 Rebuttal Markdown Numpy ONNX YOLO MD5 PyTorch TSV GPT4 Mixtral Hungarian Git BeautifulSoup CAM Bin UI Python Cloudreve FlashAttention Transformers Use v0.dev BF16 Github 顶会 Gemma Breakpoint 公式 多线程 多进程 Disk git WAN Michelin Baidu Paddle SPIE DeepSeek Pandas ChatGPT API ModelScope JSON Translation News Color 关于博主 CEIR Bipartite 第一性原理 Knowledge LLM 继承 tqdm SVR v2ray 腾讯云 InvalidArgumentError 强化学习 Ubuntu Nginx Review LoRA Bitcoin PDF XGBoost Food Ptyhon Hilton Pytorch Augmentation Claude Attention Crawler tar mmap Agent Vmess 域名 Pickle 图标 CTC CSV Template C++ RL PDB icon Land Video WebCrawler Llama Qwen2.5 RAR FastAPI FP32 论文 LeetCode UNIX CUDA Qwen FP16 HuggingFace Password Firewall Tensor Web Math 签证 VSCode PIP Diagram 报税 AI EXCEL TensorRT Anaconda Tracking Clash logger HaggingFace ResNet-50 LaTeX 论文速读 Dataset Paper diffusers Vim 图形思考法 TTS Datetime Docker Tiktoken scipy 财报 Plotly CLAP Quantize ms-swift Pillow Plate PyCharm Streamlit
站点统计

本站现有博文332篇,共被浏览869851

本站已经建立2578天!

热门文章
文章归档
回到顶部