EADST

Check the Index and Token from Tiktoken

Check the Index and Token from Tiktoken

import base64
path = "/home/your_dict_path.tiktoken"
f = open(path, "rb").read()
index = 0
for line in f.splitlines():
    l = line.split()
    print("index: ", l[1])
    print("encode: ", l[0])
    print("decode: ", base64.b64decode(l[0]))
    index += 1
    if index > 20:
        break

Reference Code

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
CV Markdown Website VPN 云服务器 CTC Hotel AI FP64 Video Llama CEIR logger 公式 GoogLeNet Michelin Heatmap Docker Paper 报税 Safetensors 域名 Git QWEN News VGG-16 Clash 顶会 Disk FastAPI v2ray Algorithm Transformers GPTQ Random Dataset Sklearn DeepSeek Streamlit Logo Shortcut SAM SPIE Hungarian ResNet-50 WebCrawler Vmess Claude PDB Diagram Tiktoken Math CAM Qwen2.5 FP32 Linux 签证 COCO tar printf CC Python Password Attention Magnet Plotly JSON Food CLAP Use RAR NLP Zip Plate diffusers tqdm Bipartite Color YOLO FP8 BF16 GIT Agent git DeepStream BTC Mixtral NLTK Breakpoint 音频 腾讯云 ONNX Google LLM UNIX 图形思考法 TSV RGB LeetCode mmap Translation XML GPT4 Crawler C++ Qwen2 Conda Domain Animate NameSilo 第一性原理 LLAMA 阿里云 InvalidArgumentError uwsgi Baidu 飞书 LaTeX Firewall Augmentation 算法题 Gemma OCR OpenAI PDF LoRA WAN Tracking Nginx ChatGPT 强化学习 IndexTTS2 GGML TensorRT Ubuntu Ptyhon Cloudreve Web PIP Bitcoin torchinfo Quantize transformers Django Proxy Bin VSCode Jetson Knowledge BeautifulSoup hf Vim SVR Pillow Anaconda TTS git-lfs Qwen PyCharm 搞笑 Interview XGBoost scipy Windows CUDA Pickle CSV Numpy Input v0.dev Card HuggingFace UI API FP16 Freesound 递归学习法 多进程 Template Base64 OpenCV Datetime Github EXCEL 多线程 HaggingFace Pytorch 继承 Permission Tensor llama.cpp Miniforge Review Search Statistics Image2Text MD5 Pandas 净利润 uWSGI Quantization 证件照 版权 Jupyter FlashAttention Bert ModelScope SQLite SQL Distillation PyTorch 财报 TensorFlow Excel 关于博主 Hilton Data Paddle Land
站点统计

本站现有博文321篇,共被浏览764573

本站已经建立2442天!

热门文章
文章归档
回到顶部