EADST

Building llama.cpp

Building for CPU

The CPU build is straightforward and works on any system with a modern C++ compiler. Here's how to do it:

cmake -B build
cmake --build build --config Release

Building with CUDA

If you have an NVIDIA GPU, you can build llama.cpp with CUDA support for significantly faster inference:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j 32

-DGGML_CUDA=ON enables CUDA support

-j 32 enables parallel compilation with 32 threads to speed up the build process

Reference

Build llama.cpp locally

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Michelin TensorFlow Search Markdown NLP LLAMA Qwen 音频 CV Statistics Attention 版权 报税 Claude Quantize Windows 关于博主 LLM 多进程 uwsgi Plotly PyTorch CC 域名 Transformers GPT4 Knowledge YOLO Shortcut Algorithm Ptyhon CAM News API diffusers Numpy Jupyter Conda Miniforge Pandas Python Land BTC Pytorch Domain ModelScope Website Math WAN TTS Llama Vim 顶会 TSV scipy VGG-16 transformers Tensor Datetime ONNX logger 图形思考法 OpenCV 公式 OCR Git Random Template torchinfo Permission 云服务器 NLTK 阿里云 Input Docker LeetCode Qwen2.5 HuggingFace Anaconda Disk CSV Translation CTC Baidu Image2Text AI 净利润 Food BeautifulSoup Tiktoken Linux WebCrawler NameSilo Mixtral FlashAttention VPN Cloudreve MD5 Github 多线程 Crawler 算法题 UI Distillation UNIX Django Excel PyCharm Dataset v2ray Bipartite Web Pillow v0.dev 签证 Paddle Logo BF16 GIT XGBoost Hilton Google Animate CLAP Bert 强化学习 LaTeX FastAPI tqdm ResNet-50 VSCode CEIR FP16 Color Use Proxy Interview Qwen2 Paper DeepStream Nginx RAR Hotel SPIE DeepSeek PDB PIP llama.cpp Plate Zip Firewall Tracking CUDA QWEN Card git PDF 飞书 SVR printf SQLite Augmentation Ubuntu Sklearn SAM 搞笑 Jetson 第一性原理 GPTQ SQL JSON Base64 Agent Diagram HaggingFace Gemma EXCEL Pickle LoRA Heatmap Clash Data RGB Freesound GoogLeNet FP64 git-lfs InvalidArgumentError Quantization uWSGI Safetensors GGML 证件照 腾讯云 Bitcoin Magnet Hungarian COCO 递归学习法 TensorRT mmap 财报 hf Breakpoint OpenAI Review Bin XML Vmess FP32 Streamlit FP8 tar 继承 ChatGPT IndexTTS2 Video C++ Password
站点统计

本站现有博文321篇,共被浏览774235

本站已经建立2464天!

热门文章
文章归档
回到顶部