EADST

Check the Index and Token from Tiktoken

Check the Index and Token from Tiktoken

import base64
path = "/home/your_dict_path.tiktoken"
f = open(path, "rb").read()
index = 0
for line in f.splitlines():
    l = line.split()
    print("index: ", l[1])
    print("encode: ", l[0])
    print("decode: ", base64.b64decode(l[0]))
    index += 1
    if index > 20:
        break

Reference Code

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
diffusers ONNX 继承 Video FP16 财报 Zip Vmess Algorithm PDB CLAP Clash Base64 News torchinfo 签证 Hilton Pillow 图形思考法 uWSGI JSON Numpy Google Template Linux AI VPN Streamlit Password Miniforge CC LLM Paddle Jetson GPTQ 顶会 Safetensors DeepStream Food tqdm NameSilo FP8 Windows BF16 YOLO 报税 Permission 域名 HuggingFace MD5 Breakpoint LaTeX Vim Tiktoken Ubuntu UNIX Pickle Animate Claude Plate WebCrawler Random Conda Review FlashAttention LoRA Knowledge Heatmap Sklearn CSV COCO RAR CAM CEIR GGML git Magnet Tensor OCR Git Llama SQLite ResNet-50 Hotel XML Bin VGG-16 Dataset tar 第一性原理 Diagram OpenAI GPT4 EXCEL API Freesound Interview NLP FastAPI QWEN XGBoost git-lfs Statistics Qwen2.5 Tracking PyTorch TensorFlow C++ printf BTC Distillation 飞书 Markdown Disk CV Quantization SQL Crawler 公式 Logo FP32 WAN Bitcoin Use llama.cpp SVR Github ChatGPT DeepSeek Shortcut v0.dev Hungarian Bipartite VSCode Nginx Michelin Datetime TSV Firewall Pandas Web BeautifulSoup 多进程 云服务器 Cloudreve 腾讯云 Gemma scipy 净利润 Anaconda Website Agent Land UI Excel GoogLeNet 搞笑 CUDA 递归学习法 TTS LLAMA Attention Domain 证件照 GIT Paper transformers 算法题 SPIE PyCharm Plotly TensorRT 多线程 PDF 关于博主 logger Augmentation Ptyhon Translation IndexTTS2 音频 Image2Text Qwen2 Docker uwsgi Input HaggingFace Data RGB Math OpenCV Pytorch CTC FP64 Bert Django ModelScope 阿里云 Proxy Search 版权 hf Transformers Qwen SAM Jupyter 强化学习 NLTK v2ray Baidu InvalidArgumentError LeetCode Card Python PIP Color Mixtral mmap Quantize
站点统计

本站现有博文321篇,共被浏览773905

本站已经建立2463天!

热门文章
文章归档
回到顶部