Building llama.cpp| 东毅居士

Building llama.cpp

作者：XD / 发表： 2025年2月19日 05:18 / 更新： 2025年2月19日 05:21 / 编程笔记 / 阅读量：819

Building for CPU

The CPU build is straightforward and works on any system with a modern C++ compiler. Here's how to do it:

cmake -B build
cmake --build build --config Release

Building with CUDA

If you have an NVIDIA GPU, you can build llama.cpp with CUDA support for significantly faster inference:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j 32

-DGGML_CUDA=ON enables CUDA support

-j 32 enables parallel compilation with 32 threads to speed up the build process

Reference

Build llama.cpp locally

本文作者：XD 转载请标明出处：http://www.eadst.com/blog/276

本站采用知识共享署名-非商业性使用-相同方式共享 4.0 国际许可协议进行许可。

上一篇
Check Linux OS Information

下一篇
C printf Usage Guide

Category

标签云

证件照报税继承 HuggingFace Image2Text Tensor 音频飞书 VPN Permission Vmess PyTorch FlashAttention TSV InvalidArgumentError 多线程 BF16 Safetensors CEIR Card Pillow Numpy SQLite Datetime Markdown logger GoogLeNet TTS Qwen Input GPT4 多进程 Magnet Django 腾讯云阿里云 Color Qwen2 Diagram PIP C++ Claude Breakpoint uWSGI Bin UI Bitcoin Llama Zip Jupyter Interview Jetson SVR diffusers VSCode Bert VGG-16 算法题 GIT tqdm LeetCode Windows COCO Linux BTC hf Conda Hilton DeepSeek Cloudreve NameSilo Google LoRA Statistics scipy Bipartite TensorRT YOLO Animate printf Pickle OCR torchinfo Algorithm Excel Gemma SQL PDB git-lfs Web llama.cpp OpenAI LLM Streamlit NLTK CTC Use Password Tiktoken MD5 v2ray Plotly uwsgi Qwen2.5 Docker CUDA ResNet-50 XML Distillation Video Paddle Hungarian Attention tar Logo Pandas OpenCV ChatGPT RAR QWEN GPTQ HaggingFace API Tracking Crawler Hotel 签证 Dataset Website FP16 Clash SPIE Pytorch 域名搞笑 FP64 LLAMA CLAP Vim Firewall DeepStream Git AI Freesound 关于博主 Nginx JSON Land FP32 Math Random CSV ONNX Base64 Quantization transformers Heatmap Review Augmentation WebCrawler Transformers RGB 版权 Template NLP PDF PyCharm Github FP8 Plate Disk WAN Baidu UNIX TensorFlow Translation Proxy ModelScope Python Domain BeautifulSoup Michelin 财报 Paper Quantize CAM git Sklearn 净利润 Shortcut 公式 mmap FastAPI Ubuntu Data XGBoost GGML EXCEL CC Anaconda Ptyhon Food CV v0.dev Knowledge LaTeX Mixtral

站点统计

本站现有博文305篇,共被浏览713320次

本站已经建立2335天!

原 Building llama.cpp

作者：XD / 发表： 2025年2月19日 05:18 / 更新： 2025年2月19日 05:21 / 编程笔记 / 阅读量：819

Building for CPU

Building with CUDA

Reference

Building llama.cpp