EADST

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "/your_deepseek-ai_DeepSeek-R1-Distill-Qwen-7B_path"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language model."
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=False)[0] # show special tokens

print("Question: \n", text)
print("Answer: \n", response)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
XML 顶会 GPTQ Github Dataset Crawler git 报税 Claude SAM Docker Llama git-lfs Bipartite 域名 ChatGPT Git Freesound TSV CAM ModelScope Land Pytorch XGBoost GoogLeNet llama.cpp hf Distillation Password OpenCV BeautifulSoup 飞书 Color 第一性原理 HuggingFace SPIE DeepSeek HaggingFace Datetime NLTK GIT FP32 SVR CUDA uWSGI NameSilo Cloudreve Heatmap OCR mmap Random Animate Food InvalidArgumentError tar BTC 版权 Math ONNX Safetensors TTS Pillow Hotel torchinfo FlashAttention logger Data Search VGG-16 递归学习法 财报 Logo Use UI transformers Domain Conda Windows Knowledge v2ray Baidu PDF Pandas CSV 多进程 Image2Text 证件照 Web API PDB Plotly Firewall scipy JSON Jupyter 继承 VPN 净利润 LaTeX Nginx VSCode FastAPI v0.dev PIP Bitcoin Mixtral Algorithm 强化学习 Input CC Base64 Magnet Tiktoken Hungarian GPT4 News CLAP Pickle Ubuntu YOLO 签证 LoRA LeetCode Permission LLM Proxy COCO 音频 Excel 腾讯云 Michelin Card ResNet-50 Attention Linux Template Python Clash Quantization NLP WebCrawler Statistics Qwen2 RGB AI Interview C++ Tensor Plate Qwen2.5 GGML Google 图形思考法 Disk Bin 多线程 Numpy PyTorch Miniforge FP8 Anaconda diffusers IndexTTS2 UNIX Breakpoint printf Bert Diagram Tracking CV Gemma Shortcut LLAMA Streamlit RAR Agent Transformers FP64 Ptyhon tqdm 阿里云 PyCharm CTC Vmess BF16 Qwen Paddle TensorFlow WAN MD5 Vim SQLite Sklearn 关于博主 Hilton SQL OpenAI uwsgi TensorRT Zip CEIR QWEN Markdown Translation FP16 Django Augmentation Video 公式 EXCEL 搞笑 算法题 Website Quantize Paper DeepStream Review Jetson
站点统计

本站现有博文320篇,共被浏览760997

本站已经建立2433天!

热门文章
文章归档
回到顶部