EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
LLAMA 图形思考法 C++ Clash Hilton Algorithm RGB Vmess Card Datetime DeepStream IndexTTS2 Rebuttal Website 签证 Baidu logger RAR Quantization Math 第一性原理 DeepSeek hf Distillation tar COCO Llama 顶会 Linux Template Cloudreve ONNX 多进程 Pillow Safetensors Jupyter Land Sklearn LoRA OCR MD5 uWSGI Tensor Use 版权 Git HuggingFace NLTK Password Gemma CAM Paper 强化学习 域名 HaggingFace 净利润 FP64 Ubuntu Michelin Heatmap 公式 Statistics 继承 uwsgi Pandas 阿里云 搞笑 BF16 算法题 NameSilo 云服务器 BTC UNIX LLM Pickle Conda Bert EXCEL Ptyhon Agent Docker Transformers Quantize Python 多线程 腾讯云 WebCrawler AI 音频 ResNet-50 Mixtral Hungarian Logo Streamlit Tracking diffusers 图标 Video SPIE ModelScope v0.dev Permission git-lfs GPT4 Jetson Knowledge GoogLeNet FastAPI Plotly Numpy PyTorch CTC Anaconda SAM llama.cpp Tiktoken 报税 CLAP Excel Zip scipy Claude Miniforge Windows XGBoost XML Nginx 关于博主 GGML printf SQLite Diagram YOLO Input NLP Random Translation PDB UI LaTeX Bipartite Firewall Hotel Dataset Pytorch Search Crawler v2ray mmap CEIR Augmentation Qwen2.5 Qwen SQL Attention LeetCode OpenCV VPN GPTQ 证件照 FlashAttention Qwen2 CSV tqdm Markdown Google Animate Proxy Freesound Shortcut transformers Paddle GIT InvalidArgumentError 递归学习法 icon Image2Text Food FP32 PIP PyCharm ChatGPT VGG-16 Bin Interview torchinfo PDF API Github Django CC TensorFlow 财报 JSON 飞书 BeautifulSoup Vim SVR News Bitcoin Review FP16 WAN Domain Color OpenAI CUDA Plate FP8 VSCode QWEN git TensorRT Breakpoint Disk Magnet TSV CV Web Base64 Data TTS
站点统计

本站现有博文324篇,共被浏览807484

本站已经建立2508天!

热门文章
文章归档
回到顶部