EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
GIT llama.cpp 算法题 GGML Domain LLAMA C++ WAN Gemma Github News 财报 Translation uwsgi Tiktoken 论文速读 PIP Tensor Bipartite Bert FP32 Diagram OpenCV Miniforge Image2Text TTS PDB Algorithm Math Agent icon CC ModelScope Pillow FP16 顶会 Logo TensorRT Permission Input Clash Conda FP8 Quantize Qwen2 CUDA Hilton diffusers Transformers Rebuttal Statistics VPN 签证 飞书 NLTK GPTQ XML Bitcoin Safetensors Django SAM 图形思考法 tqdm 腾讯云 Hungarian Sklearn Bin Nginx Datetime 域名 PyCharm 递归学习法 Claude 第一性原理 Docker COCO LaTeX Pickle UNIX ChatGPT Jetson Paddle Dataset 证件照 TensorFlow SQL Baidu SVR CSV Website Cloudreve MD5 强化学习 Windows Password Food 论文 Web Augmentation Jupyter Data Vmess Vim OCR Shortcut Freesound scipy Card AI SQLite Quantization SPIE 报税 Paper 云服务器 Color 继承 transformers BF16 IndexTTS2 CLAP Random Attention PyTorch Plotly InvalidArgumentError 净利润 TSV DeepStream EXCEL VSCode 阿里云 Video Git JSON 公式 Qwen HaggingFace HuggingFace Firewall DeepSeek Use git CAM OpenAI logger ResNet-50 Distillation Knowledge Template NameSilo 关于博主 QWEN 音频 Breakpoint FP64 Streamlit GoogLeNet LeetCode GPT4 Ptyhon Linux printf ONNX Crawler Review uWSGI Qwen2.5 Tracking VGG-16 PDF torchinfo Hotel BeautifulSoup CV Numpy CEIR Base64 Interview API FastAPI WebCrawler Disk git-lfs Python Search CTC Google Proxy Anaconda FlashAttention Magnet RAR Michelin 搞笑 hf Land UI v0.dev 图标 Excel NLP Zip mmap RGB Plate Pytorch Ubuntu v2ray YOLO BTC XGBoost tar Pandas LoRA Mixtral 多线程 Markdown LLM Llama Heatmap Animate 多进程 版权
站点统计

本站现有博文328篇,共被浏览851164

本站已经建立2558天!

热门文章
文章归档
回到顶部