EADST

Check the Index and Token from Tiktoken

Check the Index and Token from Tiktoken

import base64
path = "/home/your_dict_path.tiktoken"
f = open(path, "rb").read()
index = 0
for line in f.splitlines():
    l = line.split()
    print("index: ", l[1])
    print("encode: ", l[0])
    print("decode: ", base64.b64decode(l[0]))
    index += 1
    if index > 20:
        break

Reference Code

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
PyTorch tqdm Input Breakpoint RGB GoogLeNet Jetson Qwen Vim Sklearn git 域名 Shortcut DeepSeek llama.cpp Streamlit Claude Plate GGML BeautifulSoup SAM Google LLAMA XML Web Interview v2ray PDF InvalidArgumentError 图形思考法 SPIE Random Gemma Template LLM API GPTQ PyCharm Permission Linux C++ Website Mixtral 签证 Video Transformers Baidu Llama Ubuntu IndexTTS2 JSON FlashAttention 腾讯云 TTS BTC SQLite Github Algorithm UNIX PDB Conda OCR Vmess Pickle Base64 CC CSV 第一性原理 报税 财报 uwsgi Quantize Domain NameSilo Django 版权 Hotel Git Pytorch 顶会 Bert Math News diffusers Augmentation ChatGPT CEIR Tensor 强化学习 ResNet-50 SVR Paddle NLTK FP64 Password LaTeX 公式 FP32 NLP Jupyter WAN CLAP Michelin Knowledge Tiktoken Pillow EXCEL Datetime Firewall GPT4 净利润 递归学习法 PIP Heatmap Bipartite Cloudreve Quantization XGBoost uWSGI 音频 CUDA 算法题 Bin 证件照 多进程 Docker FP16 Nginx Statistics Search transformers Color Excel HaggingFace Anaconda DeepStream BF16 关于博主 Qwen2 Crawler Zip git-lfs Distillation torchinfo Qwen2.5 WebCrawler Numpy logger Diagram 继承 SQL 阿里云 Freesound CAM CTC Magnet Use VGG-16 MD5 LoRA ONNX scipy RAR Safetensors Review Bitcoin Ptyhon Plotly Miniforge Disk mmap UI Land OpenAI ModelScope COCO hf printf AI 飞书 Attention v0.dev GIT Card FP8 Python YOLO Food Tracking Hungarian 多线程 Hilton QWEN VSCode Windows OpenCV Clash Dataset Image2Text TensorFlow LeetCode FastAPI Logo Data Pandas Paper VPN Markdown tar 搞笑 Animate Proxy TSV TensorRT HuggingFace Agent CV Translation
站点统计

本站现有博文320篇,共被浏览756547

本站已经建立2420天!

热门文章
文章归档
回到顶部