EADST

Check the Index and Token from Tiktoken

Check the Index and Token from Tiktoken

import base64
path = "/home/your_dict_path.tiktoken"
f = open(path, "rb").read()
index = 0
for line in f.splitlines():
    l = line.split()
    print("index: ", l[1])
    print("encode: ", l[0])
    print("decode: ", base64.b64decode(l[0]))
    index += 1
    if index > 20:
        break

Reference Code

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
git Augmentation scipy 版权 Mixtral Statistics Datetime 净利润 WebCrawler CEIR CAM Bin UNIX UI 阿里云 Magnet llama.cpp Quantization LLM XML Website FP8 Land Github Cloudreve Clash IndexTTS2 Qwen2 Interview MD5 Search 报税 v2ray v0.dev Translation Baidu Input 强化学习 COCO 腾讯云 LaTeX Firewall Paddle 多线程 VSCode Attention Markdown Shortcut Excel Card Permission Michelin Llama 云服务器 Sklearn SPIE EXCEL Proxy Git Transformers Plate Bipartite 关于博主 Animate Jetson Conda RL Django logger PDB SQL 证件照 Food InvalidArgumentError Knowledge Pandas Anaconda FP32 域名 CTC SVR 公式 Qwen TTS Linux HaggingFace Python PyCharm Bitcoin DeepSeek API Tracking Heatmap CSV Random icon TensorFlow Breakpoint Agent ms-swift 签证 OpenAI XGBoost 递归学习法 Numpy 财报 Algorithm AI GoogLeNet WAN Hotel BF16 RGB Crawler printf LeetCode 算法题 NLTK ModelScope FP64 Streamlit 图形思考法 搞笑 Image2Text FlashAttention tqdm Diagram Claude Vim Ptyhon Base64 Freesound Bert HuggingFace hf Color RAR Use Windows Distillation Password FastAPI Pillow C++ TensorRT Domain transformers Gemma diffusers CC Safetensors tar BeautifulSoup OpenCV Data CUDA 第一性原理 图标 PDF CLAP GPTQ SQLite Vmess Nginx Docker Quantize GIT News Tiktoken NLP VGG-16 ONNX Hungarian DeepStream Template 多进程 JSON LLAMA torchinfo Jupyter VPN Pickle Plotly Tensor YOLO Paper GGML Google LoRA Rebuttal Logo TSV PIP 顶会 git-lfs 飞书 mmap Ubuntu SAM CV QWEN ResNet-50 音频 ChatGPT Qwen2.5 Web FP16 Review 论文速读 uwsgi PyTorch GPT4 OCR Dataset Miniforge 继承 Zip NameSilo 论文 Disk uWSGI Video Hilton Pytorch BTC Math
站点统计

本站现有博文332篇,共被浏览870025

本站已经建立2578天!

热门文章
文章归档
回到顶部