EADST

Obtain Links and Download Images from Webpages

Obtain Links and Download Images from Webpages

import requests
from bs4 import BeautifulSoup

def getHTMLText(url):
    try:
        res = requests.get(url, timeout = 6)
        res.raise_for_status()
        res.encoding = res.apparent_encoding
        return res.text
    except:
        return 'Error'

def main(url):
    demo = getHTMLText(url)
    soup = BeautifulSoup(demo, 'html.parser')
    a_labels = soup.find_all('a', attrs={'href': True})

    for idx, a in enumerate(a_labels):
        link = a.get('href')
        if "res" not in link and ".jpg" in link and idx % 50 == 1:
            urls = url + link
            save_path = "./save/" + link
            with open(save_path, 'wb') as f:
                f.write(requests.get(urls).content)


url = "http://eadst.com/"
main(url)
相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
CLAP Safetensors Michelin VSCode GIT printf v0.dev Gemma RGB HaggingFace Docker LeetCode FlashAttention PIP Hilton COCO CV Qwen 飞书 财报 AI Hungarian uwsgi Jetson Datetime Disk WebCrawler FP8 ChatGPT Animate OpenCV NLP LoRA v2ray 图形思考法 Mixtral FP16 DeepSeek Clash 强化学习 Permission transformers 关于博主 HuggingFace MD5 版权 XML FastAPI VGG-16 BF16 WAN Breakpoint Git Algorithm Anaconda FP32 Data 继承 Use XGBoost FP64 LaTeX Plotly TensorRT 签证 公式 Web tar CEIR API BeautifulSoup Django 搞笑 Conda hf ResNet-50 Linux GPT4 Knowledge Windows Interview Tracking Magnet Dataset Bipartite CTC Pillow Miniforge Nginx Image2Text Translation 递归学习法 IndexTTS2 Streamlit Freesound 算法题 SVR NLTK git-lfs Paddle GPTQ CAM Pandas Zip 顶会 SPIE Shortcut ModelScope PDF SAM Password Card Vmess Ptyhon Math git Hotel 多进程 tqdm Base64 Search TSV Tensor ONNX Diagram torchinfo Pickle 净利润 Food 阿里云 EXCEL Excel 域名 Google 证件照 NameSilo QWEN Video YOLO mmap Augmentation UI CC BTC OCR SQLite Website Distillation PyTorch Sklearn logger GGML Transformers TensorFlow Baidu JSON Statistics Bitcoin Crawler llama.cpp LLAMA Llama Markdown Land Review Vim Proxy Bin 腾讯云 Domain Qwen2.5 Claude SQL DeepStream Random Quantize Quantization Jupyter OpenAI GoogLeNet InvalidArgumentError Github Ubuntu Tiktoken 第一性原理 RAR C++ UNIX LLM Pytorch PDB Logo Paper CUDA Cloudreve uWSGI Firewall Qwen2 Python Agent 多线程 VPN Numpy Attention Bert Color 报税 Template TTS diffusers CSV 音频 Input Plate Heatmap PyCharm scipy
站点统计

本站现有博文319篇,共被浏览750597

本站已经建立2404天!

热门文章
文章归档
回到顶部