EADST

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "/your_deepseek-ai_DeepSeek-R1-Distill-Qwen-7B_path"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language model."
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=False)[0] # show special tokens

print("Question: \n", text)
print("Answer: \n", response)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Pytorch Dataset Vim 多线程 XML Breakpoint RAR FP16 Augmentation VPN Vmess Hilton CLAP diffusers Windows Miniforge Qwen2 NLTK v0.dev CV Statistics 阿里云 GPT4 Streamlit Cloudreve Ubuntu SPIE Transformers ONNX Animate Crawler Base64 Sklearn IndexTTS2 Random 多进程 Git BTC WebCrawler Website 净利润 Disk Agent InvalidArgumentError FastAPI RGB TensorFlow Tracking GPTQ LLM WAN 第一性原理 VSCode Color scipy CTC Shortcut Gemma Heatmap PyCharm Web transformers 强化学习 报税 uwsgi QWEN 公式 音频 Image2Text SVR TTS Linux LLAMA Docker FP64 Llama BF16 Mixtral C++ Input Freesound Conda v2ray CAM GoogLeNet ChatGPT git-lfs YOLO Password HaggingFace LoRA HuggingFace Magnet Interview tqdm GIT Permission Distillation 域名 Tensor mmap Pickle Claude 算法题 MD5 Clash Nginx Pillow COCO UNIX Quantize 关于博主 NLP FP8 Datetime UI 飞书 DeepSeek Algorithm Tiktoken LeetCode Math Translation Markdown Bipartite Bitcoin PDB Diagram 腾讯云 Domain Template 证件照 签证 Card AI printf Land CEIR PyTorch 顶会 BeautifulSoup Food TensorRT 递归学习法 logger FlashAttention git JSON 继承 Logo 图形思考法 Proxy ResNet-50 tar Bin Github OpenAI Hungarian Knowledge hf CSV Qwen2.5 Safetensors llama.cpp Use Python 版权 SQL SQLite Numpy GGML 财报 Baidu uWSGI CC FP32 Jetson Quantization Anaconda Django Paper XGBoost Plotly Attention NameSilo Bert Excel Pandas Qwen CUDA Paddle torchinfo Jupyter OpenCV ModelScope Hotel Ptyhon PDF Video Review TSV EXCEL API VGG-16 Zip Google PIP Firewall SAM Data Search Plate DeepStream 搞笑 LaTeX Michelin OCR
站点统计

本站现有博文319篇,共被浏览753930

本站已经建立2412天!

热门文章
文章归档
回到顶部