EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
BTC GPT4 CUDA Permission torchinfo 证件照 Vmess hf Hotel Heatmap 云服务器 Image2Text CAM SAM FP64 Land TTS 阿里云 Color Agent FP16 ChatGPT Django PIP TensorFlow ResNet-50 Anaconda Food Datetime FastAPI 签证 Cloudreve OpenCV Template JSON NLTK BF16 tqdm Card WAN CLAP Gemma Markdown icon Pickle v0.dev PDF Plotly Linux News IndexTTS2 音频 Search Breakpoint Qwen2.5 GIT LLAMA Magnet Github ModelScope Web git-lfs Attention printf 图形思考法 VPN CV Sklearn 强化学习 Hilton Interview 图标 Michelin Random Freesound uwsgi Google LeetCode Statistics Claude WebCrawler NameSilo SPIE Domain AI InvalidArgumentError Use BeautifulSoup 腾讯云 Proxy EXCEL 公式 Git Pillow Paper 财报 Bert CC 继承 VGG-16 飞书 Knowledge Bin YOLO NLP FlashAttention Numpy Qwen Transformers 搞笑 关于博主 PyCharm GPTQ FP32 Ubuntu XML tar Video Dataset Plate Streamlit Quantize Crawler 算法题 QWEN FP8 UNIX v2ray Input HuggingFace diffusers DeepSeek transformers Vim Miniforge LLM Python LoRA uWSGI Translation llama.cpp Quantization Pandas VSCode Logo RGB Website COCO Bitcoin Conda 报税 LaTeX 多线程 scipy Diagram 递归学习法 Animate Distillation DeepStream GGML TSV Tracking SVR Base64 PyTorch Bipartite Password Disk API Nginx 域名 SQLite CEIR OpenAI Math Pytorch TensorRT Ptyhon Windows Algorithm OCR git Clash Mixtral Hungarian CSV C++ CTC Firewall Tiktoken 多进程 版权 Shortcut Jupyter Review Excel ONNX mmap HaggingFace Augmentation GoogLeNet Jetson Zip RAR XGBoost PDB Tensor Baidu Paddle Qwen2 净利润 Safetensors Docker MD5 Data SQL logger 第一性原理 UI Llama 顶会
站点统计

本站现有博文322篇,共被浏览792259

本站已经建立2488天!

热门文章
文章归档
回到顶部