东毅居士

一套基本“能用”的Search架构 -- Advanced Search System

作者：XD / 发表： 2026年1月5日 06:22 / 科研学习/ 阅读量：2

Search从来不是一个向量接口，而是一个长期演进的生产系统Advanced Search System。

Self-Evolving AI Agents：自我进化智能体研究全景综述

作者：XD / 发表： 2025年12月24日 03:15 / 科研学习/ 阅读量：45

🔥 Self-Evolving AI Agents：自我进化智能体研究全景综述

SAM 3：从 Prompt 到通用分割 — 一个技术/算法深度解析

作者：XD / 发表： 2025年11月30日 23:45 / 科研学习/ 阅读量：294

SAM 3：从 Prompt 到通用分割 — 一个技术/算法深度解析

WAN 2.2-Animate 技术原理解析

作者：XD / 发表： 2025年10月30日 22:23 / 科研学习/ 阅读量：477

WAN 2.2-Animate 技术原理解析

IndexTTS2 Quick Review

作者：XD / 发表： 2025年9月16日 07:16 / 科研学习/ 阅读量：514

IndexTTS-2：零样本情绪控制与时长可控的下一代语音合成

CLAP 模型：对齐音频与文本的跨模态

作者：XD / 发表： 2025年8月26日 03:31 / 科研学习/ 阅读量：613

CLAP 模型的结构文本输入 → 文本编码器 → 投影层 → 共享语义空间音频输入 → 音频编码器 → 投影层 → 共享语义空间

gemma-3n-E4B-it 模型框架图+简要解析

作者：XD / 发表： 2025年6月30日 04:08 / 科研学习/ 阅读量：968

gemma-3n-E4B-it 框架图

gemma-3n-E4B-it配置文件config.json解析

作者：XD / 发表： 2025年6月30日 03:46 / 科研学习/ 阅读量：601

gemma-3n-E4B-it配置文件config.json解析

FP8位数解析

作者：XD / 发表： 2025年5月6日 02:15 / 科研学习/ 阅读量：1309

在 AI 模型越来越庞大的今天，我们面临的不仅是算力挑战，更有带宽、能耗和模型部署的瓶颈。正因如此，更高效的数值表示方式成为突破口，其中最受关注的就是 FP8（8位浮点数）格式。

Understanding FP32 and FP64: Single and Double Precision Floating Point

作者：XD / 发表： 2024年8月5日 05:44 / 科研学习/ 阅读量：1933

Understanding FP32 and FP64: Single and Double Precision Floating Point

Understanding BF16: Brain Floating Point Format

作者：XD / 发表： 2024年8月5日 05:31 / 科研学习/ 阅读量：2508

Understanding BF16: Brain Floating Point Format

Understanding FP16: Half-Precision Floating Point

作者：XD / 发表： 2024年8月5日 05:28 / 科研学习/ 阅读量：1999

Understanding FP16: Half-Precision Floating Point

Lucid Plugin from ChatGPT to Creating the Diagram

作者：XD / 发表： 2024年1月31日 23:02 / 科研学习/ 阅读量：1872

Lucid Plugin from ChatGPT to Creating the Diagram

Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs

作者：XD / 发表： 2023年12月7日 00:45 / 科研学习/ 阅读量：1833

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Paper: https://arxiv.org/abs/2211.10438

Code: https://github.com/mit-han-lab/smoothquant

Organization: MIT

Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

作者：XD / 发表： 2023年12月7日 00:38 / 科研学习/ 阅读量：2391

AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Paper: https://arxiv.org/abs/2306.00978

Code: https://github.com/mit-han-lab/llm-awq/

Organization: MIT

Quick Review: ZeroQuant-FP

作者：XD / 发表： 2023年12月7日 00:32 / 科研学习/ 阅读量：1981

ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats

Paper: https://arxiv.org/abs/2307.09782

Code: https://github.com/microsoft/DeepSpeed

Organization: Microsoft

原 一套基本“能用”的Search架构 -- Advanced Search System

作者：XD / 发表： 2026年1月5日 06:22 / 科研学习/ 阅读量：2

原 Self-Evolving AI Agents：自我进化智能体研究全景综述

作者：XD / 发表： 2025年12月24日 03:15 / 科研学习/ 阅读量：45

原 SAM 3：从 Prompt 到通用分割 — 一个技术/算法深度解析

作者：XD / 发表： 2025年11月30日 23:45 / 科研学习/ 阅读量：294

原 WAN 2.2-Animate 技术原理解析

作者：XD / 发表： 2025年10月30日 22:23 / 科研学习/ 阅读量：477

原 IndexTTS2 Quick Review

作者：XD / 发表： 2025年9月16日 07:16 / 科研学习/ 阅读量：514

原 CLAP 模型：对齐音频与文本的跨模态

作者：XD / 发表： 2025年8月26日 03:31 / 科研学习/ 阅读量：613

原 gemma-3n-E4B-it 模型框架图+简要解析

作者：XD / 发表： 2025年6月30日 04:08 / 科研学习/ 阅读量：968

原 gemma-3n-E4B-it配置文件config.json解析

作者：XD / 发表： 2025年6月30日 03:46 / 科研学习/ 阅读量：601

原 FP8位数解析

作者：XD / 发表： 2025年5月6日 02:15 / 科研学习/ 阅读量：1309

原 Understanding FP32 and FP64: Single and Double Precision Floating Point

作者：XD / 发表： 2024年8月5日 05:44 / 科研学习/ 阅读量：1933

原 Understanding BF16: Brain Floating Point Format

作者：XD / 发表： 2024年8月5日 05:31 / 科研学习/ 阅读量：2508

原 Understanding FP16: Half-Precision Floating Point

作者：XD / 发表： 2024年8月5日 05:28 / 科研学习/ 阅读量：1999

原 Lucid Plugin from ChatGPT to Creating the Diagram

作者：XD / 发表： 2024年1月31日 23:02 / 科研学习/ 阅读量：1872

原 Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs

作者：XD / 发表： 2023年12月7日 00:45 / 科研学习/ 阅读量：1833

原 Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

作者：XD / 发表： 2023年12月7日 00:38 / 科研学习/ 阅读量：2391

原 Quick Review: ZeroQuant-FP

作者：XD / 发表： 2023年12月7日 00:32 / 科研学习/ 阅读量：1981

一套基本“能用”的Search架构 -- Advanced Search System

Self-Evolving AI Agents：自我进化智能体研究全景综述

SAM 3：从 Prompt 到通用分割 — 一个技术/算法深度解析

WAN 2.2-Animate 技术原理解析

IndexTTS2 Quick Review

CLAP 模型：对齐音频与文本的跨模态

gemma-3n-E4B-it 模型框架图+简要解析

gemma-3n-E4B-it配置文件config.json解析

FP8位数解析

Understanding FP32 and FP64: Single and Double Precision Floating Point

Understanding BF16: Brain Floating Point Format

Understanding FP16: Half-Precision Floating Point

Lucid Plugin from ChatGPT to Creating the Diagram

Quick Review: SmoothQuant: Accurate and Efficient Post-Training Quantization for LLMs

Quick Review: AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Quick Review: ZeroQuant-FP