EADST

Building llama.cpp

Building for CPU

The CPU build is straightforward and works on any system with a modern C++ compiler. Here's how to do it:

cmake -B build
cmake --build build --config Release

Building with CUDA

If you have an NVIDIA GPU, you can build llama.cpp with CUDA support for significantly faster inference:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j 32

-DGGML_CUDA=ON enables CUDA support

-j 32 enables parallel compilation with 32 threads to speed up the build process

Reference

Build llama.cpp locally

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Password CUDA GGML 搞笑 VPN OCR v2ray Quantize Image2Text VSCode GPT4 ChatGPT v0.dev ONNX GPTQ Conda TSV Dataset Attention FP16 TTS Vmess ModelScope Base64 TensorFlow Animate Paddle CEIR CC LLAMA Land Pandas ms-swift FastAPI transformers InvalidArgumentError HaggingFace 关于博主 腾讯云 GoogLeNet Safetensors Git FP64 News Magnet AI 云服务器 第一性原理 NLP Crawler Hilton 签证 tar Diagram Jupyter Zip Transformers Rebuttal 递归学习法 Python IndexTTS2 图形思考法 Google Website QWEN Pillow Domain RAR PyTorch Interview CTC Algorithm Heatmap Windows Review BF16 Gemma Bin git Mixtral COCO CV BeautifulSoup 净利润 Firewall 域名 MD5 Pickle Hotel Paper Ptyhon Color Sklearn PDF git-lfs Plate LLM uWSGI Bipartite LaTeX Distillation Math Excel Vim EXCEL Freesound Translation Streamlit 多进程 Baidu Statistics DeepStream Llama BTC Jetson Clash UI 顶会 Input Tracking NameSilo printf Breakpoint 财报 论文 uwsgi hf icon LoRA RGB 音频 Markdown DeepSeek 公式 PIP mmap Bert PyCharm Logo Linux GIT TensorRT Permission XGBoost Claude Proxy 阿里云 证件照 Qwen2.5 NLTK Bitcoin Numpy FP8 JSON API SAM Agent CSV Knowledge Tensor logger Tiktoken 继承 llama.cpp WAN SQL OpenAI XML Qwen2 HuggingFace OpenCV Augmentation C++ 图标 scipy Cloudreve CAM Food torchinfo Web VGG-16 SVR WebCrawler 版权 飞书 Data FlashAttention Hungarian Django Github 报税 PDB Random 算法题 Qwen FP32 强化学习 SPIE Search UNIX Pytorch Nginx Michelin Card Use Template tqdm Datetime Disk Plotly Quantization Anaconda YOLO Shortcut Docker diffusers CLAP ResNet-50 论文速读 SQLite Miniforge Ubuntu Video 多线程 LeetCode
站点统计

本站现有博文330篇,共被浏览860843

本站已经建立2569天!

热门文章
文章归档
回到顶部