EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Search Pickle Password Use 算法题 Baidu Conda Color 飞书 CEIR printf VGG-16 Logo Random TensorFlow tqdm Pillow OpenAI Heatmap Safetensors GoogLeNet Animate logger HuggingFace Docker 关于博主 Image2Text CV Tiktoken Food Card Freesound Zip Git 强化学习 图形思考法 SPIE API Template 域名 继承 Land Paddle ModelScope LLM 云服务器 VPN 递归学习法 FP32 SAM Domain 第一性原理 TTS 阿里云 ChatGPT Crawler UI Qwen2 Knowledge GGML Gemma CSV Qwen2.5 CLAP Pandas Distillation 多进程 v0.dev Statistics uwsgi Magnet Agent Breakpoint Dataset GIT SQL Datetime tar Base64 CC MD5 DeepSeek NLTK FP16 Disk Qwen Bert Website 顶会 uWSGI Miniforge GPTQ ResNet-50 torchinfo diffusers Plate SQLite Jetson PDF Windows Proxy Nginx PIP Python HaggingFace Github 证件照 RGB Cloudreve TensorRT Tensor InvalidArgumentError CTC FP8 Anaconda transformers Vmess GPT4 Quantize Quantization hf CAM RAR COCO git-lfs BF16 Jupyter News YOLO 报税 Attention TSV EXCEL 签证 Pytorch C++ Sklearn Review Hilton Hungarian NLP Ptyhon Bipartite WAN llama.cpp Interview Claude FP64 Google Clash Markdown PyCharm BTC Mixtral 净利润 搞笑 FastAPI Paper Plotly Algorithm Llama VSCode NameSilo Transformers LaTeX DeepStream Michelin 音频 Django Translation Math scipy Firewall XGBoost v2ray QWEN WebCrawler mmap BeautifulSoup Numpy Bin Hotel Augmentation 腾讯云 Excel Streamlit 财报 Shortcut ONNX 版权 Video JSON 公式 Bitcoin LoRA 多线程 Web git LeetCode FlashAttention PDB PyTorch OpenCV Permission AI XML Vim IndexTTS2 CUDA Data UNIX SVR Diagram Tracking Ubuntu LLAMA OCR Linux Input
站点统计

本站现有博文321篇,共被浏览766914

本站已经建立2448天!

热门文章
文章归档
回到顶部