EADST

Check the Index and Token from Tiktoken

Check the Index and Token from Tiktoken

import base64
path = "/home/your_dict_path.tiktoken"
f = open(path, "rb").read()
index = 0
for line in f.splitlines():
    l = line.split()
    print("index: ", l[1])
    print("encode: ", l[0])
    print("decode: ", base64.b64decode(l[0]))
    index += 1
    if index > 20:
        break

Reference Code

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
XGBoost Ubuntu UNIX Food Shortcut 域名 VSCode Datetime Python Michelin 腾讯云 Quantization 递归学习法 Qwen2.5 Anaconda Dataset LLAMA Translation git-lfs Cloudreve hf Markdown SPIE logger CSV 公式 llama.cpp 报税 Firewall Streamlit mmap scipy ModelScope RGB LeetCode Disk Freesound GPTQ Qwen2 Clash VGG-16 第一性原理 transformers GPT4 XML Agent BF16 净利润 Image2Text HuggingFace 版权 Docker Quantize Rebuttal Transformers NameSilo v0.dev Tiktoken TTS CAM Magnet API Augmentation Data Ptyhon 音频 Animate Pillow Diagram CEIR 关于博主 Mixtral GIT Breakpoint Statistics Gemma NLTK 飞书 Review Paper Bitcoin CUDA IndexTTS2 Tensor Miniforge 阿里云 git PIP Search CLAP DeepSeek 证件照 SQL FP64 WAN ONNX Website ResNet-50 Jupyter VPN Excel 签证 GGML Domain FP32 Password BTC Conda BeautifulSoup 图标 CV 多进程 YOLO Paddle Card tqdm FlashAttention Use Github OpenCV torchinfo WebCrawler DeepStream 搞笑 Bipartite Sklearn SAM Input Vim v2ray uwsgi EXCEL Math Permission Google Nginx PyCharm InvalidArgumentError Windows LaTeX Zip Web News ChatGPT Heatmap Numpy 强化学习 CC Linux Distillation Land 顶会 Crawler Algorithm 继承 SVR PyTorch 云服务器 tar Logo AI Jetson FP16 Git C++ Color Safetensors TensorFlow Pandas Tracking Random Django PDB FastAPI LoRA Hungarian SQLite Bin FP8 Baidu Pytorch Video Attention QWEN Knowledge Proxy TensorRT UI Plotly Pickle Llama Base64 MD5 Vmess TSV HaggingFace Hilton JSON OCR Plate Qwen icon Bert COCO LLM 多线程 财报 Template uWSGI Interview Claude diffusers RAR OpenAI CTC 算法题 图形思考法 GoogLeNet Hotel NLP PDF printf
站点统计

本站现有博文324篇,共被浏览807530

本站已经建立2508天!

热门文章
文章归档
回到顶部