EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Paper CSV HaggingFace LaTeX 多线程 飞书 Distillation hf Interview VPN uWSGI SVR Bitcoin RAR Video XGBoost COCO uwsgi TSV GGML Breakpoint Docker Jetson mmap GPTQ Pytorch Claude Baidu TensorFlow C++ LLAMA v0.dev InvalidArgumentError Pickle 搞笑 算法题 tqdm GPT4 财报 BF16 Quantization NLTK Hotel git Disk Data Vim WebCrawler Random Knowledge ms-swift ResNet-50 LLM 证件照 顶会 git-lfs BTC Gemma Domain Firewall Git Excel GIT 公式 tar VSCode 图标 Plate NLP PIP diffusers 签证 腾讯云 OpenAI Qwen2.5 净利润 CUDA ModelScope Windows Input Llama Jupyter ONNX News JSON Food Image2Text 多进程 FlashAttention FP32 继承 Linux Diagram Cloudreve logger 论文速读 Permission llama.cpp Translation Statistics 报税 GoogLeNet Miniforge Nginx YOLO FP64 Numpy QWEN Tracking HuggingFace BeautifulSoup PyCharm Clash OCR Base64 FastAPI 强化学习 UNIX SPIE Paddle Plotly WAN Web Pillow Conda Safetensors Magnet Attention Search Math AI ChatGPT CTC Qwen2 API TensorRT 论文 Pandas Sklearn 第一性原理 Augmentation Algorithm FP8 icon Freesound Template Bert Crawler Shortcut v2ray Card LeetCode Michelin OpenCV Markdown 关于博主 Ubuntu PDF CC Review IndexTTS2 递归学习法 Logo FP16 CLAP EXCEL TTS Transformers Password 域名 Bipartite scipy 云服务器 NameSilo Ptyhon DeepStream transformers CAM Dataset Color Google Github Python Rebuttal UI Quantize Vmess SQLite Mixtral Heatmap Django Anaconda Proxy Qwen 音频 PyTorch Tensor Land CV LoRA MD5 Zip Datetime SQL Bin torchinfo Animate Use 阿里云 VGG-16 图形思考法 Streamlit RGB DeepSeek 版权 Agent printf Hungarian PDB Tiktoken Website Hilton XML SAM CEIR
站点统计

本站现有博文330篇,共被浏览860868

本站已经建立2569天!

热门文章
文章归档
回到顶部