EADST

Building llama.cpp

Building for CPU

The CPU build is straightforward and works on any system with a modern C++ compiler. Here's how to do it:

cmake -B build
cmake --build build --config Release

Building with CUDA

If you have an NVIDIA GPU, you can build llama.cpp with CUDA support for significantly faster inference:

cmake -B build -DGGML_CUDA=ON
cmake --build build --config Release -j 32

-DGGML_CUDA=ON enables CUDA support

-j 32 enables parallel compilation with 32 threads to speed up the build process

Reference

Build llama.cpp locally

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
v2ray GIT Github git-lfs Pickle transformers CTC Django Data TSV Tiktoken Statistics Bert NameSilo TensorRT Rebuttal uWSGI Permission OpenAI YOLO Vmess CV Paddle TensorFlow Vim 论文 CAM 递归学习法 Hotel Qwen2.5 Video Website API Crawler tqdm 域名 FlashAttention Base64 云服务器 PIP News InvalidArgumentError Markdown BeautifulSoup Ptyhon Domain NLTK LLAMA v0.dev VSCode Random IndexTTS2 Windows WAN Git tar Streamlit Tracking 净利润 版权 Color FP32 Numpy MD5 Translation FP8 Pandas Breakpoint Heatmap logger TTS Google 搞笑 SPIE Sklearn Image2Text diffusers DeepStream GPT4 算法题 飞书 Plotly 多线程 PDF 强化学习 Use Bipartite Diagram 音频 第一性原理 LoRA Pytorch Baidu Hungarian Search Math Jupyter 签证 Michelin Quantize NLP scipy SVR CEIR Magnet GoogLeNet RAR GPTQ Attention Quantization Password PDB C++ Interview Template Food GGML Bin 关于博主 CC VGG-16 Llama Input 多进程 printf ResNet-50 Cloudreve Algorithm Plate PyCharm Clash Conda ModelScope Review 顶会 Web FastAPI Anaconda SAM AI QWEN Mixtral Datetime CLAP HuggingFace Pillow WebCrawler COCO Disk Hilton Dataset Linux Distillation XML uwsgi RGB BTC UI CUDA Nginx 证件照 LLM 报税 UNIX Gemma Paper PyTorch Bitcoin mmap Safetensors ChatGPT Miniforge git Jetson Knowledge 腾讯云 Qwen Freesound Proxy SQLite Card BF16 Transformers SQL LaTeX CSV Python Shortcut Tensor 论文速读 Agent llama.cpp FP64 OpenCV Docker 图形思考法 财报 Zip FP16 ONNX Logo 阿里云 Excel hf Land Augmentation VPN HaggingFace Firewall 继承 Ubuntu JSON DeepSeek 公式 LeetCode XGBoost icon Qwen2 图标 torchinfo EXCEL Animate OCR Claude
站点统计

本站现有博文327篇,共被浏览825999

本站已经建立2532天!

热门文章
文章归档
回到顶部