EADST

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "/your_deepseek-ai_DeepSeek-R1-Distill-Qwen-7B_path"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language model."
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=False)[0] # show special tokens

print("Question: \n", text)
print("Answer: \n", response)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
版权 SQL v2ray transformers Logo Freesound Sklearn git-lfs Vmess Bert PIP DeepSeek 图形思考法 Base64 证件照 uWSGI mmap COCO CEIR GoogLeNet RGB HuggingFace FP8 Website 公式 Baidu Quantization 第一性原理 Datetime Pytorch BTC OCR CV FP32 Claude ChatGPT SQLite Land Jetson Augmentation Tracking Image2Text Streamlit 签证 Llama Dataset TensorFlow Card Windows tqdm Quantize Python Crawler Conda InvalidArgumentError Agent 财报 C++ Cloudreve 强化学习 SAM Animate 域名 Zip Proxy torchinfo Knowledge Bin ONNX PDF NLTK Rebuttal Search EXCEL UI Paper Bipartite Excel DeepStream Plate Color Interview Statistics SPIE CAM Pandas Hilton Google Diagram 报税 ResNet-50 Translation LLAMA Web GGML FlashAttention git Use GPT4 Linux printf YOLO Domain VGG-16 Input ModelScope QWEN 阿里云 Tiktoken VPN NLP 论文 CLAP icon TTS Data PyTorch Template Attention BF16 Review 继承 scipy diffusers Qwen2.5 Ubuntu Random PyCharm Pickle Shortcut Tensor LLM v0.dev Hotel TensorRT Paddle logger Safetensors CSV tar 论文速读 Michelin WebCrawler 云服务器 Hungarian Heatmap Distillation SVR IndexTTS2 CC OpenCV 图标 顶会 飞书 NameSilo WAN Anaconda Markdown OpenAI MD5 Qwen Food 多进程 TSV LaTeX 音频 hf Mixtral GIT Github Django Video 递归学习法 HaggingFace Breakpoint VSCode Clash 腾讯云 Vim CTC API JSON uwsgi Docker News BeautifulSoup Numpy Password Plotly FP16 Git Jupyter Miniforge RAR XGBoost Qwen2 XML GPTQ Transformers Math CUDA 多线程 FastAPI 净利润 LeetCode Permission Algorithm Nginx PDB Pillow UNIX AI Bitcoin 搞笑 FP64 Gemma 算法题 Firewall Disk LoRA Magnet 关于博主 llama.cpp Ptyhon
站点统计

本站现有博文328篇,共被浏览842938

本站已经建立2548天!

热门文章
文章归档
回到顶部