EADST

Pytorch: Freeze layers to Finetune the Model

Pytorch: Freeze layers to Finetune the Model.

for k, v in model.named_parameters():
    print(k) # check the layer name
for k, v in model.named_parameters():
    if k in ["last.weight", "last.bias"]: # freeze the layer with the given name list
        v.requires_grad = True
    else:
        v.requires_grad = False
       
      
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Attention 强化学习 RGB Markdown Knowledge Card Land Input NameSilo GoogLeNet ChatGPT Color Tracking TSV logger HaggingFace Safetensors XML FlashAttention SAM 公式 Qwen diffusers Llama GPT4 NLTK Jetson Jupyter Hungarian InvalidArgumentError Shortcut Vim Augmentation Quantization NLP UNIX Linux Excel Docker Permission C++ Breakpoint Ptyhon Search Math DeepStream SVR Transformers Disk 域名 LaTeX 飞书 Conda Proxy Rebuttal UI QWEN Paper FP16 Cloudreve WAN Firewall Anaconda AI Bipartite 版权 Website 音频 PDB 多进程 MD5 Tensor 关于博主 Heatmap Ubuntu Zip Miniforge Use TensorRT 腾讯云 API Video Bitcoin Bin Gemma 云服务器 签证 Magnet VGG-16 git Statistics Algorithm Review Logo 图标 FP32 Random FP8 BF16 CC tqdm Numpy Hotel LLAMA LLM COCO Pytorch DeepSeek Freesound OpenCV Pillow CLAP uWSGI Github SQLite Datetime LoRA icon Template News RAR GIT Qwen2.5 Agent Data ONNX Claude Crawler Bert CAM OpenAI IndexTTS2 XGBoost v2ray llama.cpp YOLO 净利润 HuggingFace 财报 GGML PyTorch Password Michelin Plotly Base64 BeautifulSoup 算法题 SPIE hf Translation FP64 Nginx Paddle JSON Django PyCharm transformers Quantize Plate VSCode Interview Mixtral PIP printf Web GPTQ Animate EXCEL WebCrawler OCR Clash Windows Vmess CV mmap Baidu TensorFlow CTC Qwen2 ModelScope BTC Pandas PDF 论文速读 多线程 Domain Pickle TTS scipy Hilton CEIR Streamlit LeetCode Sklearn 证件照 递归学习法 CUDA SQL 第一性原理 继承 报税 VPN tar Python git-lfs CSV v0.dev 顶会 uwsgi Food torchinfo ResNet-50 搞笑 Distillation Diagram Dataset 图形思考法 Image2Text FastAPI Tiktoken 阿里云 Google Git
站点统计

本站现有博文326篇,共被浏览825009

本站已经建立2530天!

热门文章
文章归档
回到顶部