EADST

Check the Index and Token from Tiktoken

Check the Index and Token from Tiktoken

import base64
path = "/home/your_dict_path.tiktoken"
f = open(path, "rb").read()
index = 0
for line in f.splitlines():
    l = line.split()
    print("index: ", l[1])
    print("encode: ", l[0])
    print("decode: ", base64.b64decode(l[0]))
    index += 1
    if index > 20:
        break

Reference Code

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
GPTQ CAM JSON 证件照 Pytorch CEIR Zip Linux BTC Distillation TensorFlow API Anaconda VSCode 关于博主 LaTeX RGB UI Dataset uwsgi Shortcut Image2Text 签证 Vmess Jetson GPT4 VGG-16 搞笑 域名 mmap Conda Windows WAN git Vim 继承 净利润 Video diffusers Nginx OpenCV Math UNIX Base64 Google Hotel v2ray git-lfs PIP XML SAM Firewall PyCharm Freesound FlashAttention Attention Excel Statistics Card 报税 Random LLM LeetCode TSV Website Pandas PyTorch GIT Knowledge Tensor Streamlit uWSGI WebCrawler PDB logger Sklearn Augmentation Password v0.dev Paddle GoogLeNet Quantization Baidu Gemma Breakpoint EXCEL SVR Bitcoin Permission Cloudreve Qwen2 Logo Paper ModelScope OCR 阿里云 NLTK Translation Clash LoRA AI Interview TensorRT Michelin Use RAR BeautifulSoup tar llama.cpp DeepSeek CC InvalidArgumentError torchinfo Qwen2.5 Ptyhon Disk Crawler Magnet 公式 多线程 QWEN Pillow SQL Bin tqdm 算法题 Hungarian Numpy PDF Markdown BF16 CV Plate Miniforge Tiktoken YOLO Llama Proxy Tracking Review SQLite FP16 Django IndexTTS2 Docker Template Claude NLP Datetime Land Python FP8 MD5 OpenAI printf FP64 Bert Mixtral Git FP32 COCO Web DeepStream HaggingFace transformers Plotly Safetensors 飞书 Animate Pickle Diagram Transformers Quantize Data Bipartite Github Qwen 版权 Ubuntu 腾讯云 ResNet-50 LLAMA 多进程 Jupyter Algorithm CLAP XGBoost VPN Food Heatmap HuggingFace CTC SPIE TTS scipy hf CSV NameSilo Input 音频 Color Domain Hilton ChatGPT FastAPI CUDA C++ 财报 ONNX GGML
站点统计

本站现有博文309篇,共被浏览735970

本站已经建立2373天!

热门文章
文章归档
回到顶部