EADST

Building llama.cpp

Building for CPU

The CPU build is straightforward and works on any system with a modern C++ compiler. Here's how to do it:

cmake -B build
cmake --build build --config Release

Building with CUDA

If you have an NVIDIA GPU, you can build llama.cpp with CUDA support for significantly faster inference:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j 32

-DGGML_CUDA=ON enables CUDA support

-j 32 enables parallel compilation with 32 threads to speed up the build process

Reference

Build llama.cpp locally

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Proxy 净利润 Qwen2.5 uWSGI Hilton scipy CLAP Review Disk 财报 PDB FP16 Website Logo Miniforge SPIE SAM PDF Safetensors Plate Algorithm WebCrawler printf GIT LoRA logger Template DeepSeek PyCharm Excel VPN Food Bin CC SQLite 强化学习 Windows BeautifulSoup Claude InvalidArgumentError SVR 签证 RGB ResNet-50 GGML Conda CSV 证件照 OpenCV LLM Vmess Input CAM RAR GPT4 Sklearn mmap JSON PyTorch OpenAI 多线程 UI YOLO transformers Tracking git PIP Paper v2ray torchinfo BF16 顶会 TensorFlow Bitcoin Card 阿里云 Crawler Pandas 图形思考法 Breakpoint Pytorch WAN Translation Heatmap Statistics Color Augmentation Tensor Ptyhon Random Pillow Shortcut 飞书 Bert COCO NLTK Agent OCR Hungarian GoogLeNet Domain Datetime Michelin diffusers Markdown Plotly News Dataset Github ChatGPT Video Password LeetCode icon VSCode 关于博主 Mixtral 第一性原理 HuggingFace Attention Search ModelScope ONNX Base64 tar 公式 Pickle 音频 Firewall IndexTTS2 CEIR Bipartite Quantize FP8 Web Animate uwsgi CV Google Linux CTC Gemma Diagram NameSilo MD5 UNIX Knowledge Llama 搞笑 tqdm DeepStream Quantization Distillation Cloudreve VGG-16 TensorRT TSV 腾讯云 云服务器 Hotel Python FP64 版权 HaggingFace v0.dev Magnet 域名 LLAMA Land Zip GPTQ Freesound Numpy Transformers XML C++ Tiktoken FlashAttention Vim Jupyter TTS Streamlit Qwen2 XGBoost 继承 Git EXCEL Use Django Clash Interview FastAPI Permission Ubuntu Nginx Jetson 图标 CUDA 报税 Paddle QWEN hf Anaconda 算法题 Baidu 多进程 LaTeX Image2Text Qwen SQL FP32 llama.cpp Data 递归学习法 API git-lfs BTC Docker Math NLP AI
站点统计

本站现有博文322篇,共被浏览790329

本站已经建立2486天!

热门文章
文章归档
回到顶部