EADST

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "/your_deepseek-ai_DeepSeek-R1-Distill-Qwen-7B_path"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language model."
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=False)[0] # show special tokens

print("Question: \n", text)
print("Answer: \n", response)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Algorithm Use UI Paper Nginx Windows XGBoost Data Hotel Translation TTS InvalidArgumentError SQL Base64 FP64 VSCode Markdown Shortcut scipy hf Hungarian GPT4 ChatGPT 报税 Jupyter Clash Pytorch 多线程 WAN 净利润 Michelin HuggingFace Sklearn FP32 云服务器 Linux News CV printf VPN C++ IndexTTS2 torchinfo Search YOLO Llama Diagram GPTQ SPIE LaTeX uWSGI CLAP QWEN MD5 Permission Land 版权 Magnet Django VGG-16 CTC Crawler WebCrawler uwsgi GGML FP8 SQLite Food CSV 音频 递归学习法 v2ray Python OpenCV EXCEL Video Zip Bin Tensor Github Ubuntu NLP Random Datetime Agent diffusers Streamlit LLM JSON Hilton Dataset 顶会 Color SVR Freesound 图标 Tracking Safetensors 算法题 关于博主 CEIR 签证 OCR ONNX Excel Card Breakpoint Disk BTC Baidu DeepSeek 腾讯云 tqdm FP16 icon Rebuttal Numpy Mixtral CAM Paddle BF16 Vim Heatmap llama.cpp Math TensorFlow Interview Vmess Conda Knowledge 多进程 域名 Transformers 公式 v0.dev ModelScope BeautifulSoup Google Quantization Bipartite AI Password RGB Git Pandas PyTorch Ptyhon 第一性原理 Cloudreve Tiktoken ResNet-50 LoRA 飞书 Gemma Anaconda Jetson 证件照 Review LeetCode OpenAI Plate XML tar Template CC 搞笑 CUDA Domain DeepStream SAM Augmentation HaggingFace Qwen2 Web 财报 Animate COCO FlashAttention PDB PIP Qwen2.5 Docker 阿里云 PyCharm PDF TSV Input Quantize Firewall 继承 Bitcoin NLTK Bert Statistics Attention TensorRT UNIX Pillow mmap Website git transformers Logo Proxy Distillation GIT 强化学习 Miniforge LLAMA logger FastAPI RAR Plotly git-lfs API GoogLeNet Claude 图形思考法 Pickle NameSilo Image2Text Qwen
站点统计

本站现有博文323篇,共被浏览803495

本站已经建立2503天!

热门文章
文章归档
回到顶部