EADST

Building llama.cpp

Building for CPU

The CPU build is straightforward and works on any system with a modern C++ compiler. Here's how to do it:

cmake -B build
cmake --build build --config Release

Building with CUDA

If you have an NVIDIA GPU, you can build llama.cpp with CUDA support for significantly faster inference:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j 32

-DGGML_CUDA=ON enables CUDA support

-j 32 enables parallel compilation with 32 threads to speed up the build process

Reference

Build llama.cpp locally

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
ONNX 图标 IndexTTS2 SVR Land hf Agent Food CTC Augmentation Pickle GGML VSCode Qwen2 Card Jupyter Docker Random NameSilo icon FP32 Zip Password 版权 Streamlit 财报 PyTorch 公式 XGBoost Tensor OCR FP64 算法题 SAM UI GoogLeNet WebCrawler Excel SQL Baidu Ubuntu FlashAttention tqdm ChatGPT Statistics WAN Anaconda Animate TTS GPTQ SPIE OpenCV CSV Crawler Github VPN Bert CV llama.cpp PDB v2ray Input PDF Vim LLM QWEN 顶会 GIT Bitcoin TensorFlow Quantize Firewall Qwen2.5 TensorRT 多线程 diffusers Gemma 搞笑 ModelScope Shortcut v0.dev Logo LeetCode Review XML Template Markdown InvalidArgumentError printf Heatmap 第一性原理 Django HuggingFace Knowledge Permission 音频 Hungarian transformers Nginx Website Paddle Pytorch GPT4 Search 图形思考法 Miniforge 阿里云 News Color git-lfs 继承 Pandas FastAPI 净利润 YOLO MD5 腾讯云 飞书 DeepSeek torchinfo 强化学习 scipy LLAMA NLTK VGG-16 Claude Web uwsgi 签证 uWSGI RGB Clash Data Diagram Quantization Python Algorithm tar 递归学习法 域名 Qwen UNIX ResNet-50 Git COCO Magnet Windows CC Bin TSV Breakpoint Freesound Translation RAR EXCEL FP16 OpenAI Transformers 报税 AI JSON Plotly CAM PyCharm Disk logger Michelin CUDA Tracking LoRA Safetensors API Video mmap Tiktoken Plate BeautifulSoup BTC 证件照 Sklearn 云服务器 Image2Text git CEIR Domain Distillation Bipartite Ptyhon Paper HaggingFace Vmess FP8 Dataset Google Llama Pillow LaTeX 多进程 Base64 Hilton PIP C++ Jetson Proxy CLAP Datetime DeepStream Hotel Numpy Math Use SQLite Mixtral Interview BF16 Cloudreve 关于博主 NLP Conda Linux Attention
站点统计

本站现有博文322篇,共被浏览790298

本站已经建立2486天!

热门文章
文章归档
回到顶部