EADST

Check the Index and Token from Tiktoken

Check the Index and Token from Tiktoken

import base64
path = "/home/your_dict_path.tiktoken"
f = open(path, "rb").read()
index = 0
for line in f.splitlines():
    l = line.split()
    print("index: ", l[1])
    print("encode: ", l[0])
    print("decode: ", base64.b64decode(l[0]))
    index += 1
    if index > 20:
        break

Reference Code

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
transformers Excel hf BTC Cloudreve Crawler 多进程 SVR Hotel Attention Llama Plotly tar mmap Template Qwen2 Interview 证件照 公式 Paddle PyTorch Safetensors InvalidArgumentError Linux Shortcut WebCrawler llama.cpp PDF RGB Data Gemma Git Hungarian Random Website Food Windows COCO NLP GPT4 CTC ModelScope Disk torchinfo Numpy VSCode git diffusers Heatmap Land XML Conda Miniforge OpenCV Image2Text 搞笑 Claude Freesound 域名 YOLO Jetson Pandas 报税 ChatGPT Clash printf BF16 Anaconda Password 多线程 SQL 签证 Datetime Google CC Vim v2ray Domain Bert Review Qwen uWSGI Nginx LoRA Mixtral Bitcoin 继承 Knowledge EXCEL 阿里云 Tensor FP16 Algorithm Translation ONNX JSON Magnet TSV Permission IndexTTS2 DeepSeek FastAPI Paper 算法题 Pillow BeautifulSoup 递归学习法 TTS CSV OpenAI TensorRT Zip CLAP Jupyter PyCharm Augmentation Bin Quantize Input CEIR FP8 Plate 财报 Ubuntu GoogLeNet Transformers AI MD5 FP32 Vmess QWEN Ptyhon 关于博主 净利润 Breakpoint API Markdown Hilton Tiktoken scipy Python Animate Web Use Michelin Firewall Bipartite Dataset LeetCode FlashAttention 腾讯云 音频 OCR Github Proxy Card LLAMA git-lfs XGBoost PIP UI DeepStream HuggingFace 第一性原理 LaTeX GGML Baidu Sklearn SAM Agent 飞书 Django uwsgi Video Diagram Logo GPTQ Qwen2.5 Statistics logger Pickle Color HaggingFace Tracking SQLite VGG-16 版权 CV 图形思考法 WAN ResNet-50 RAR UNIX CAM Math NameSilo v0.dev CUDA GIT SPIE Distillation FP64 tqdm NLTK C++ Streamlit VPN Base64 Docker TensorFlow PDB Quantization Pytorch LLM
站点统计

本站现有博文316篇,共被浏览746719

本站已经建立2394天!

热门文章
文章归档
回到顶部