EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Qwen2 WebCrawler BTC Google FP16 v0.dev Sklearn Baidu 继承 Hotel 图形思考法 CEIR Pickle NLP News AI XML GPTQ 顶会 Github FlashAttention DeepStream Bitcoin SQL torchinfo InvalidArgumentError GoogLeNet OpenCV FP32 LLM Claude Bin 域名 算法题 OpenAI Paper 腾讯云 净利润 强化学习 uwsgi XGBoost Diagram Tiktoken 报税 CAM SQLite CUDA Disk Python Bipartite Search C++ 公式 多线程 Attention Base64 音频 Linux CLAP PDB PDF printf API 飞书 SAM Heatmap COCO TSV Food Git Video Animate Distillation Random BeautifulSoup Color Plotly LoRA EXCEL CSV Gemma PyTorch Algorithm Land WAN 搞笑 hf Tracking QWEN Mixtral Website Knowledge Password Llama Django Agent Streamlit NLTK Plate YOLO 签证 Miniforge FastAPI 阿里云 Freesound Pandas Card OCR 多进程 VSCode Template git transformers Dataset Datetime Firewall Data GPT4 Numpy Docker Qwen NameSilo Crawler Math Hungarian Jetson SVR 关于博主 Conda Paddle JSON Domain git-lfs Windows FP8 TTS VGG-16 IndexTTS2 Statistics Quantize Tensor Markdown 版权 Transformers Cloudreve Vmess LeetCode mmap Ptyhon scipy HaggingFace ONNX 递归学习法 SPIE Magnet TensorFlow ChatGPT FP64 tqdm ResNet-50 RAR UNIX LLAMA Qwen2.5 LaTeX Jupyter 财报 GGML Input llama.cpp Vim Review uWSGI 云服务器 Hilton Use Logo UI Anaconda v2ray Pytorch 第一性原理 Nginx Augmentation PyCharm Safetensors Translation Image2Text HuggingFace Pillow logger CV BF16 ModelScope CTC MD5 Bert PIP Excel Interview Web Permission GIT tar Proxy Shortcut Breakpoint diffusers CC 证件照 Quantization TensorRT Clash VPN RGB DeepSeek Michelin Zip Ubuntu
站点统计

本站现有博文321篇,共被浏览772961

本站已经建立2461天!

热门文章
文章归档
回到顶部