EADST

Quick Review: SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

SpQR: A Sparse-Quantized Representation for Near-Lossless Large Language Model Weight Compression

Core Approach:

  • GPTQ without Outliers: Focuses on eliminating outliers during the GPTQ process, enabling more efficient and accurate weight compression for large language models.
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Gemma Permission YOLO LLAMA Use Qwen2 净利润 NLP SPIE Excel 域名 Video COCO Markdown printf 腾讯云 Quantization Google WebCrawler HuggingFace ONNX FP8 Domain VSCode ModelScope Michelin LoRA Bin OpenAI hf Windows 多进程 BF16 FP64 Vim Jupyter Magnet CSV OCR Docker Django IndexTTS2 scipy QWEN Safetensors Password Paddle 关于博主 第一性原理 Paper Statistics News UI CEIR Zip logger GGML Hilton EXCEL Random Nginx 继承 Tracking icon XML Bipartite uWSGI VPN diffusers Color RGB Tensor Firewall Linux Image2Text Crawler HaggingFace Ubuntu RAR Git Interview Land Pillow AI SQLite Food Base64 C++ Proxy Input Bitcoin OpenCV Quantize Jetson Anaconda WAN DeepSeek SAM CV TensorFlow Translation v2ray Cloudreve Freesound uwsgi Pickle Shortcut 证件照 搞笑 Baidu transformers Disk SVR Attention Math Card Web Transformers Hungarian Mixtral 强化学习 GPTQ Llama Pytorch git Clash Tiktoken TensorRT Website mmap Agent InvalidArgumentError v0.dev Python GoogLeNet git-lfs Ptyhon TTS 飞书 torchinfo PDF Algorithm NameSilo Template Augmentation Heatmap NLTK 签证 XGBoost FP32 云服务器 图形思考法 UNIX SQL 财报 VGG-16 tar 阿里云 LeetCode FP16 Animate LLM CTC 音频 ChatGPT Datetime Pandas CC Numpy Knowledge Plate Github Qwen 多线程 Miniforge llama.cpp Dataset PyTorch FastAPI 顶会 TSV tqdm GPT4 Search 报税 版权 Vmess Bert Hotel Data Distillation Logo PIP API 算法题 MD5 PDB DeepStream Claude CLAP BeautifulSoup CUDA Diagram Review ResNet-50 PyCharm BTC CAM Conda 递归学习法 公式 GIT Sklearn Plotly FlashAttention JSON Streamlit 图标 Qwen2.5 Breakpoint LaTeX
站点统计

本站现有博文322篇,共被浏览785571

本站已经建立2479天!

热门文章
文章归档
回到顶部