EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Food Logo PDB Statistics Attention GGML YOLO Use PIP Data 第一性原理 Pillow icon Numpy Input Zip Github Crawler Django Baidu LoRA Land Python diffusers Gemma 继承 Bin GIT Cloudreve SVR IndexTTS2 UI Ubuntu Bitcoin Search Distillation 证件照 CAM Tracking Base64 Sklearn CUDA Transformers BTC Proxy Video TTS Qwen WAN Jupyter Pandas Michelin 云服务器 Tiktoken VPN API HaggingFace 签证 Animate LLM CEIR Claude git VSCode v0.dev 阿里云 Diagram Dataset LeetCode AI CV 递归学习法 Paddle 公式 关于博主 ONNX Translation 图标 Ptyhon BF16 Color Rebuttal printf Augmentation Pickle Paper 音频 PyTorch OCR hf tqdm Card FP32 Math 强化学习 Plotly scipy llama.cpp QWEN Mixtral Markdown FastAPI FP16 Heatmap SQLite TSV torchinfo News Qwen2.5 Datetime SQL Streamlit 腾讯云 Plate v2ray Shortcut Vmess COCO ChatGPT Hungarian Template PyCharm FP64 mmap Quantization Knowledge logger VGG-16 PDF 多进程 Random uWSGI Nginx HuggingFace EXCEL MD5 FlashAttention transformers Review DeepSeek 图形思考法 Interview DeepStream TensorFlow CLAP Safetensors 飞书 Git 财报 RGB Miniforge Docker 多线程 Permission ModelScope UNIX 净利润 Freesound TensorRT Pytorch Tensor RAR git-lfs Clash FP8 GPT4 XGBoost C++ SAM uwsgi Web GoogLeNet GPTQ Website Firewall Quantize NameSilo CTC Windows LLAMA tar WebCrawler 顶会 OpenCV Qwen2 Llama Agent NLTK InvalidArgumentError Hilton Anaconda OpenAI Magnet Excel Password Vim BeautifulSoup Breakpoint Hotel Bert 域名 CSV Image2Text 报税 ResNet-50 Domain Algorithm 算法题 Jetson Disk 版权 NLP Conda XML CC Bipartite Google LaTeX JSON Linux SPIE 搞笑
站点统计

本站现有博文324篇,共被浏览823410

本站已经建立2529天!

热门文章
文章归档
回到顶部