EADST

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

Transformers Demo for DeepSeek-R1-Distill-Qwen-7B

from transformers import AutoModelForCausalLM, AutoTokenizer

model_name = "/your_deepseek-ai_DeepSeek-R1-Distill-Qwen-7B_path"

model = AutoModelForCausalLM.from_pretrained(
    model_name,
    torch_dtype="auto",
    device_map="auto"
)
tokenizer = AutoTokenizer.from_pretrained(model_name)

prompt = "Give me a short introduction to large language model."
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": prompt}
]
text = tokenizer.apply_chat_template(
    messages,
    tokenize=False,
    add_generation_prompt=True
)
model_inputs = tokenizer([text], return_tensors="pt").to(model.device)

generated_ids = model.generate(
    **model_inputs,
    max_new_tokens=2048
)
generated_ids = [
    output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
]

response = tokenizer.batch_decode(generated_ids, skip_special_tokens=False)[0] # show special tokens

print("Question: \n", text)
print("Answer: \n", response)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
递归学习法 Data Food Pandas Vim Google Jupyter Bert Michelin TensorFlow uwsgi Distillation Freesound Gemma 音频 Ptyhon diffusers Miniforge GPTQ CV Tiktoken Website YOLO FastAPI NameSilo OpenCV Augmentation Shortcut printf MD5 SVR FP64 CEIR 证件照 Conda Excel Math PDB 财报 Template QWEN Password 公式 Logo WAN Knowledge Windows Video HaggingFace LaTeX PIP Nginx Use ModelScope FP32 Rebuttal icon Translation Animate NLP Land 版权 Tracking 关于博主 IndexTTS2 RGB git-lfs CC scipy TensorRT uWSGI logger Transformers LLAMA tar v2ray v0.dev Zip Bin Interview transformers Pytorch 继承 Quantize GoogLeNet Pillow Hungarian API mmap Input FP8 Claude Cloudreve Attention GGML Random Django OpenAI LLM FlashAttention Linux BTC VPN DeepSeek Vmess Permission LoRA ChatGPT Tensor Diagram Qwen2.5 Magnet SQL 第一性原理 Domain Crawler NLTK DeepStream tqdm VGG-16 CTC Github Pickle Baidu XGBoost Review Card UNIX 强化学习 多线程 算法题 CLAP 报税 TSV CUDA Git Statistics BeautifulSoup SPIE Mixtral XML Ubuntu Jetson Hotel Web Qwen 阿里云 LeetCode Paddle GPT4 Algorithm FP16 域名 飞书 TTS ONNX llama.cpp Heatmap hf ResNet-50 Breakpoint PDF OCR BF16 HuggingFace UI Sklearn Datetime SQLite 图形思考法 Numpy Docker CAM WebCrawler Agent 多进程 Streamlit Clash Disk SAM PyTorch Quantization 腾讯云 torchinfo InvalidArgumentError Hilton Paper COCO Bipartite Proxy Anaconda Bitcoin Plate Search Firewall C++ AI Image2Text Python 图标 git 搞笑 EXCEL 顶会 VSCode News RAR 净利润 Base64 GIT Plotly 签证 Qwen2 Color Dataset Llama JSON 云服务器 CSV Markdown Safetensors PyCharm
站点统计

本站现有博文324篇,共被浏览821687

本站已经建立2527天!

热门文章
文章归档
回到顶部