EADST

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "/your_deepseek-ai_DeepSeek-R1-Distill-Qwen-7B_path"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language model."
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=False)[0] # show special tokens

print("Question: \n", text)
print("Answer: \n", response)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
GIT NLTK Crawler ONNX Review SQL Excel PyCharm uWSGI Color Vim Domain Paddle TensorRT diffusers Input 搞笑 CUDA Distillation DeepStream Docker 域名 Github UI CSV Vmess 强化学习 多进程 LLAMA Food LLM FlashAttention Conda Nginx Quantize 图形思考法 Interview DeepSeek Baidu Jetson Magnet 公式 Animate Django Website Knowledge Transformers GPT4 Web Proxy 算法题 Pytorch git 音频 v0.dev 证件照 VPN UNIX Python 递归学习法 Claude torchinfo XML Statistics Augmentation CAM Hotel 净利润 Bin LaTeX Card CTC Qwen2 RGB WebCrawler SAM Google API SVR Data Random FP64 Agent 关于博主 VSCode Safetensors FP8 Shortcut ModelScope BeautifulSoup v2ray Markdown logger Tensor 版权 Translation Bipartite NameSilo tar 多线程 EXCEL Qwen2.5 WAN Land Firewall 云服务器 JSON Permission Llama Password Datetime RAR llama.cpp 继承 Clash MD5 XGBoost Bitcoin Tiktoken Pandas Image2Text Sklearn scipy Bert Logo ChatGPT 顶会 飞书 Freesound Math YOLO 签证 Zip Hungarian InvalidArgumentError News CEIR PyTorch Linux Paper IndexTTS2 PDF BF16 LoRA Windows Ptyhon ResNet-50 Search Base64 Michelin Disk Cloudreve Use Miniforge HuggingFace Breakpoint FP32 TSV CV TensorFlow OpenCV C++ VGG-16 Heatmap Jupyter TTS Ubuntu Plotly 财报 Numpy Template Dataset SQLite uwsgi SPIE tqdm 阿里云 NLP mmap Video hf Algorithm CLAP transformers FP16 GPTQ Attention GoogLeNet git-lfs 腾讯云 CC PDB Anaconda Diagram GGML Pillow 报税 BTC Pickle 第一性原理 Gemma LeetCode FastAPI OpenAI HaggingFace PIP Tracking Streamlit Mixtral Plate Hilton Qwen Quantization QWEN printf COCO AI OCR Git
站点统计

本站现有博文321篇,共被浏览770834

本站已经建立2458天!

热门文章
文章归档
回到顶部