EADST

Code for SPIE paper - CEIR

CEIR

This project is for the SPIE paper - Novel Receipt Recognition with Deep Learning Algorithms. In this paper, we propose an end-to-end novel receipt recognition system for capturing effective information from receipts (CEIR).

CEIR code and results have been made available at: CEIR code

CEIR system demo is available at: CEIR Demo

The CEIR has three parts: preprocess, detection, recognition.

Introduction

In the preprocessing method, by converting the image to gray scale and obtaining the gradient with the Sobel operator, the outline of the receipt area is decided by morphological transformations with the elliptic kernel.

In text detection, the modified connectionist text proposal network to execute text detection. The pytorch implementation of detection is based on CTPN.

In text recognition, the convolutional recurrent neural network with the connectionist temporal classification with maximum entropy regularization as a loss function to update the weights in networks and extract the characters from receipt. The pytorch implementation of recognition is based on CRNN and ENESCTC.

We validate our system with the scanned receipts optical character recognition and information extraction (SROIE) database.

Dependency

Python 3.6.3 1. torch==1.4 2. torchvision 3. opencv-python 4. lmdb

Prediction

  1. Download pre-trained model from Google Drive and put the file under ./detection/output/ folder.

  2. Change the image name to demo.jpg in the CEIR folder.

  3. Run python ceir_crop.py for stage 1.
  4. Run python ceir_detect.py for stage 2.
  5. Run python ceir_recognize.py for stage 3.

  6. The result will be saved in ./result/.

Training

  1. Put dataset in ./dataset/train/image and ./dataset/train/label.

  2. Preprocess parameters can be changed in ./preprocess/crop.py.

  3. In the detection part, the ./detection/config.py is used for configuring. After that, run python train.py in the detection folder.

  4. In recognition, you need to change trainroot and other parameters in train.sh, then run sh train.sh to train.

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
HuggingFace OCR 第一性原理 Logo Tensor Docker 图形思考法 uWSGI CUDA COCO Miniforge Bert Paper Augmentation 递归学习法 财报 Translation 版权 UNIX ms-swift Google Ptyhon scipy WebCrawler Gemma BeautifulSoup BTC UI LeetCode 云服务器 PyCharm VSCode BF16 腾讯云 Hotel CEIR 飞书 Quantize GoogLeNet TTS XML Search 强化学习 Bitcoin Zip Review 签证 证件照 FP32 Python Jetson icon Bin Template Input Paddle 音频 GPT4 Vmess CV SPIE 阿里云 FP64 Crawler ONNX Web YOLO LoRA Pickle Animate tar Use TSV 域名 Llama InvalidArgumentError JSON PIP 论文 Mixtral CC Random Rebuttal 论文速读 ResNet-50 Hilton CSV Jupyter printf uwsgi News 顶会 Freesound Qwen Claude tqdm 关于博主 Shortcut Attention Password Land Pillow Excel transformers Git Dataset FlashAttention NLTK Tracking logger VGG-16 Sklearn 多进程 PDB 图标 torchinfo SQL TensorRT Baidu LLM 多线程 Firewall DeepSeek Pytorch Nginx Qwen2.5 GPTQ Bipartite 公式 CTC Clash Windows 继承 Interview Quantization 净利润 Pandas Tiktoken git v2ray Anaconda LaTeX FP16 OpenAI Permission llama.cpp NLP Domain Linux OpenCV Color 搞笑 diffusers Plotly Vim RAR Magnet FastAPI Safetensors Datetime XGBoost GGML Breakpoint PyTorch Statistics Proxy Disk 报税 Streamlit TensorFlow ChatGPT Diagram Website Numpy Plate Cloudreve Video mmap GIT VPN Github Hungarian Distillation AI Knowledge Algorithm API QWEN PDF DeepStream Markdown FP8 算法题 Agent ModelScope Michelin IndexTTS2 git-lfs WAN C++ Qwen2 Ubuntu Base64 NameSilo Data Food Conda CAM Card v0.dev Transformers SVR Heatmap Math MD5 SQLite Image2Text RGB HaggingFace Django hf LLAMA CLAP EXCEL SAM
站点统计

本站现有博文330篇,共被浏览860840

本站已经建立2569天!

热门文章
文章归档
回到顶部