EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Dataset Linux RAR SPIE VPN BeautifulSoup Breakpoint Password Input 继承 Algorithm 财报 Qwen2.5 Baidu FP32 CEIR Numpy Jetson logger CAM Paper 第一性原理 GIT OCR Hotel Knowledge Attention CC QWEN Statistics PyCharm uWSGI Python VSCode 论文 Agent C++ XML Image2Text transformers Qwen2 Pandas Website Land 论文速读 Distillation Tracking NameSilo BTC InvalidArgumentError Streamlit 算法题 GoogLeNet Augmentation SQL Pickle Use Sklearn 腾讯云 COCO ModelScope API LeetCode Animate Data Vmess Conda ResNet-50 Plotly torchinfo FlashAttention SAM Ptyhon 搞笑 Tiktoken GPT4 FastAPI OpenCV llama.cpp DeepSeek LaTeX Ubuntu git-lfs Bitcoin Google Cloudreve Rebuttal CV Magnet JSON UI EXCEL Transformers Bin 净利润 UNIX Jupyter Vim Crawler DeepStream Web ChatGPT 音频 阿里云 Anaconda Hilton Review mmap Quantize Clash TensorRT Pytorch HaggingFace git Miniforge CUDA BF16 TTS CTC tqdm icon MD5 Gemma 报税 TSV SQLite 图形思考法 递归学习法 uwsgi FP8 Search Shortcut Bert Datetime Quantization GGML v0.dev VGG-16 IndexTTS2 HuggingFace 公式 LoRA Nginx Claude 飞书 Interview Heatmap Proxy PyTorch WebCrawler printf 版权 LLAMA Food Freesound 顶会 Windows Michelin Template Qwen Bipartite Math Plate Base64 PDB 签证 AI CLAP Translation CSV Markdown GPTQ Django Excel FP16 Safetensors 域名 WAN ONNX v2ray Permission Pillow Git hf Paddle PDF Diagram Tensor diffusers NLP SVR Random FP64 tar Domain Disk 云服务器 强化学习 多进程 关于博主 News Firewall 证件照 YOLO Color Github NLTK Logo Llama 多线程 XGBoost 图标 Docker Mixtral Hungarian Zip OpenAI LLM RGB Video TensorFlow scipy PIP Card
站点统计

本站现有博文328篇,共被浏览843658

本站已经建立2549天!

热门文章
文章归档
回到顶部