EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Python WAN ResNet-50 CSV TensorRT 云服务器 tar XML HuggingFace SQL Michelin 签证 Card Ptyhon ChatGPT BTC Qwen2.5 Cloudreve Vim Gemma FP16 Breakpoint ModelScope YOLO Safetensors Distillation Base64 GPTQ Github Paddle 证件照 Anaconda 腾讯云 CTC Hungarian Markdown Agent Animate 算法题 Color ONNX UI v2ray PyTorch DeepSeek Random FP8 Windows FlashAttention C++ transformers BeautifulSoup GPT4 CUDA 第一性原理 Translation QWEN FastAPI SAM BF16 Freesound GGML SPIE Disk git-lfs Hilton Food Web SQLite NLP Logo VGG-16 LeetCode UNIX Ubuntu Review 域名 Website Data 图标 Git Claude 关于博主 Docker llama.cpp v0.dev OCR AI Crawler FP64 公式 NLTK torchinfo Quantization 多线程 tqdm LLAMA Domain Search Sklearn mmap LoRA Template printf LLM HaggingFace Tiktoken Django Rebuttal Pickle CAM Nginx Statistics Qwen2 Google Numpy FP32 Zip Diagram Heatmap 论文 logger Pandas CC Miniforge WebCrawler VSCode 财报 Bipartite Image2Text 论文速读 Pillow Transformers PIP Bitcoin RGB 强化学习 IndexTTS2 COCO VPN SVR Interview scipy Permission News icon Input 版权 Quantize Baidu EXCEL JSON 飞书 API LaTeX Firewall Dataset 继承 Augmentation Streamlit Attention 音频 Conda Qwen 净利润 Bert Linux Tensor CLAP Magnet CEIR hf Pytorch 递归学习法 PyCharm 多进程 uwsgi Plotly Jupyter Clash Plate git RAR Datetime Mixtral DeepStream Bin Knowledge Land 搞笑 CV OpenAI Proxy 顶会 uWSGI 阿里云 Algorithm NameSilo TensorFlow InvalidArgumentError Math PDF Shortcut MD5 diffusers 图形思考法 TSV OpenCV XGBoost Paper PDB TTS GoogLeNet GIT Tracking Use Jetson Hotel Vmess Video Excel Password 报税 Llama
站点统计

本站现有博文328篇,共被浏览843682

本站已经建立2549天!

热门文章
文章归档
回到顶部