EADST

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "/your_deepseek-ai_DeepSeek-R1-Distill-Qwen-7B_path"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language model."
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=False)[0] # show special tokens

print("Question: \n", text)
print("Answer: \n", response)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
继承 OpenCV FlashAttention 财报 Web diffusers Proxy NLTK Password MD5 HaggingFace CC 强化学习 C++ Hungarian CAM RGB Windows FP16 Disk Input Gemma 多进程 Miniforge Agent TSV 报税 Hotel EXCEL Food Baidu Vim Bitcoin SAM NLP WAN Jupyter LLAMA Quantization Anaconda Pytorch ModelScope Excel Vmess LLM Sklearn 多线程 Card Pillow Pandas Cloudreve Interview Logo uWSGI Zip CLAP Website 公式 Color scipy CTC BF16 签证 Hilton 证件照 Crawler HuggingFace Python TTS GGML DeepSeek Translation PIP Statistics IndexTTS2 CSV Docker Tracking Land Conda Augmentation PyTorch RAR GPT4 YOLO 递归学习法 Firewall Math QWEN CUDA InvalidArgumentError Markdown XML transformers Transformers Diagram FastAPI Algorithm Data FP8 Github BeautifulSoup FP32 UNIX Permission printf Qwen2.5 Breakpoint 关于博主 Bert Bipartite GIT Animate 音频 Bin Qwen2 算法题 UI Image2Text Paper Use Ubuntu AI Dataset Review git Claude VPN LeetCode Plotly Magnet GoogLeNet Pickle LoRA Tensor XGBoost VSCode COCO 飞书 API Google DeepStream git-lfs Freesound Streamlit Plate Michelin SQLite 搞笑 Qwen Quantize SVR OCR Llama 顶会 hf Linux Video Base64 WebCrawler 域名 Distillation Django Heatmap SPIE Paddle 腾讯云 Jetson OpenAI TensorFlow Ptyhon tqdm Search v2ray LaTeX 版权 Attention GPTQ PDF Domain 图形思考法 TensorRT Tiktoken CV 阿里云 llama.cpp Knowledge Shortcut Numpy logger Mixtral torchinfo v0.dev JSON SQL uwsgi ChatGPT ResNet-50 Clash NameSilo PDB Datetime PyCharm FP64 ONNX Nginx tar Template BTC Random CEIR mmap Git Safetensors VGG-16 第一性原理 净利润
站点统计

本站现有博文319篇,共被浏览753945

本站已经建立2412天!

热门文章
文章归档
回到顶部