EADST

Building llama.cpp

Building for CPU

The CPU build is straightforward and works on any system with a modern C++ compiler. Here's how to do it:

cmake -B build
cmake --build build --config Release

Building with CUDA

If you have an NVIDIA GPU, you can build llama.cpp with CUDA support for significantly faster inference:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j 32

-DGGML_CUDA=ON enables CUDA support

-j 32 enables parallel compilation with 32 threads to speed up the build process

Reference

Build llama.cpp locally

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Magnet 财报 WebCrawler Quantization v0.dev BF16 CSV OpenCV HaggingFace Web SQL Statistics Paddle ModelScope ChatGPT Bin API SPIE CTC Google Datetime Translation Heatmap 签证 GIT PDB Ubuntu Transformers Plotly CEIR Cloudreve Qwen2.5 TSV tar Paper 递归学习法 VSCode News transformers CUDA uWSGI Password Domain AI v2ray LLAMA Vmess CAM PIP Animate Plate Excel 净利润 Attention Color TTS RGB TensorRT FP8 Diagram Dataset Quantize FP64 Markdown scipy 强化学习 Miniforge EXCEL git 继承 DeepStream Claude llama.cpp NLTK 版权 Qwen Baidu Input FlashAttention Crawler LaTeX CC Breakpoint 第一性原理 FP16 GGML Agent GPTQ Bipartite Bitcoin 搞笑 SVR Streamlit Tiktoken printf Image2Text Interview Base64 Hungarian PyTorch Vim Data Template Github Pillow OpenAI Algorithm Distillation 证件照 Land Tracking Hilton PDF 算法题 Nginx Safetensors CV Tensor Pytorch Shortcut SAM Card RAR COCO 多线程 公式 tqdm Pandas Michelin Python XGBoost Gemma LeetCode Clash NLP LoRA logger UI TensorFlow JSON Jupyter WAN uwsgi 域名 Ptyhon Augmentation UNIX InvalidArgumentError 顶会 飞书 BeautifulSoup OCR Llama LLM Use 腾讯云 DeepSeek NameSilo mmap Random PyCharm Sklearn Conda FP32 Hotel 阿里云 Website Freesound FastAPI Firewall ResNet-50 报税 Search CLAP Permission Anaconda Zip 音频 图形思考法 C++ Food ONNX Git 关于博主 GoogLeNet Knowledge Bert Numpy diffusers Proxy git-lfs HuggingFace VPN BTC Disk hf Logo torchinfo Video Review Jetson VGG-16 多进程 GPT4 Docker YOLO Windows QWEN Django Mixtral MD5 IndexTTS2 Linux Math SQLite Qwen2 XML Pickle
站点统计

本站现有博文320篇,共被浏览756814

本站已经建立2421天!

热门文章
文章归档
回到顶部