EADST

Check the Index and Token from Tiktoken

Check the Index and Token from Tiktoken

import base64
path = "/home/your_dict_path.tiktoken"
f = open(path, "rb").read()
index = 0
for line in f.splitlines():
    l = line.split()
    print("index: ", l[1])
    print("encode: ", l[0])
    print("decode: ", base64.b64decode(l[0]))
    index += 1
    if index > 20:
        break

Reference Code

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
HuggingFace Baidu Python 关于博主 Agent Streamlit PDB Plate Linux OpenAI Claude Attention mmap Translation Bert Qwen2 Input HaggingFace Hotel diffusers TensorFlow TensorRT Github Nginx Plotly Magnet NameSilo SQLite Cloudreve XGBoost tqdm NLP Augmentation InvalidArgumentError FP8 Permission uWSGI Web Interview RAR GIT Animate ONNX WAN SAM Card 净利润 C++ Random Color News 第一性原理 Bipartite 多进程 Git Website Logo VGG-16 Hungarian 多线程 Zip CSV transformers 证件照 Password QWEN CUDA 域名 SPIE Disk LLM 腾讯云 MD5 图标 git-lfs ResNet-50 icon Llama Django Statistics Shortcut Qwen2.5 v2ray Quantization SQL API LLAMA 算法题 Pandas Datetime Distillation UI Domain Ptyhon FlashAttention Anaconda Heatmap 财报 GoogLeNet Template Base64 顶会 SVR Jupyter Firewall PIP OCR AI Tensor CV 音频 搞笑 LeetCode Numpy Data Safetensors EXCEL LaTeX VSCode Quantize 飞书 Michelin tar Pillow XML NLTK VPN Search TTS Docker Pytorch Vim Tiktoken torchinfo 版权 ChatGPT 图形思考法 PyCharm Transformers BF16 Freesound Miniforge git Video Conda CEIR OpenCV PDF 云服务器 Windows Review Sklearn CLAP Ubuntu GPT4 v0.dev Pickle COCO FP32 logger Breakpoint DeepSeek LoRA Bin Tracking Jetson TSV hf YOLO Bitcoin CTC JSON ModelScope Clash UNIX 签证 llama.cpp Paddle Dataset Knowledge CC 阿里云 Mixtral Land IndexTTS2 FastAPI Google CAM Hilton scipy RGB 继承 公式 Algorithm GGML GPTQ Diagram Crawler Paper Excel DeepStream Qwen WebCrawler Math Vmess PyTorch Food Markdown uwsgi Proxy 报税 FP16 BTC BeautifulSoup FP64 printf Image2Text 强化学习 递归学习法 Gemma Use
站点统计

本站现有博文322篇,共被浏览789059

本站已经建立2484天!

热门文章
文章归档
回到顶部