EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
GGML tar Distillation 证件照 Search RAR Gemma Knowledge GIT SPIE HuggingFace 阿里云 Use FP64 Video PyCharm Math WebCrawler tqdm TTS Linux Conda uWSGI Random GoogLeNet 净利润 git SVR C++ PyTorch Cloudreve 签证 Web CC Plate Algorithm Template Tensor AI Hungarian Animate Website Python BTC Firewall EXCEL GPT4 报税 Nginx MD5 强化学习 YOLO NameSilo Numpy Paper Pickle UI ModelScope Safetensors OpenCV Qwen Pandas 版权 FP32 Baidu Docker Vmess hf Tiktoken 公式 Diagram diffusers Clash NLTK JSON 递归学习法 GPTQ SQLite HaggingFace Permission Anaconda Zip PIP 腾讯云 COCO InvalidArgumentError Llama Review 云服务器 Quantization CV Translation BF16 printf LLAMA Bin Markdown v2ray 多线程 TensorRT Color Bitcoin Plotly Interview FastAPI Pytorch Michelin torchinfo Attention 搞笑 Food QWEN VGG-16 继承 Bert Shortcut DeepStream UNIX Password OCR Windows RGB 财报 第一性原理 Magnet Freesound Card 关于博主 Breakpoint CEIR Git CSV Ubuntu Base64 Image2Text scipy PDF mmap 域名 LaTeX 多进程 Ptyhon OpenAI Django Tracking Land NLP Claude llama.cpp ChatGPT uwsgi WAN Bipartite v0.dev FP8 图形思考法 Miniforge logger Quantize XML Crawler Github Sklearn transformers 飞书 Google TensorFlow Domain PDB 顶会 Datetime CUDA Qwen2 BeautifulSoup DeepSeek IndexTTS2 Dataset Logo ONNX LeetCode Statistics TSV Disk Jetson Paddle Vim FP16 Hotel FlashAttention SAM Hilton ResNet-50 LLM Agent Mixtral Qwen2.5 Proxy LoRA SQL CTC 算法题 VPN Excel Heatmap 音频 CLAP API Augmentation CAM Jupyter Streamlit Pillow Transformers XGBoost git-lfs News Input VSCode Data
站点统计

本站现有博文321篇,共被浏览776823

本站已经建立2468天!

热门文章
文章归档
回到顶部