EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
icon DeepSeek 多进程 API Google 云服务器 Logo 第一性原理 FP64 C++ hf Excel Use 强化学习 transformers Knowledge NameSilo 图标 报税 Conda Input Shortcut Bipartite Gemma TensorRT SQLite Miniforge VPN OCR Streamlit llama.cpp Cloudreve Ptyhon Heatmap Web SQL GIT Review RGB Animate Dataset Template Sklearn TSV PyCharm 签证 Algorithm SAM 论文 LLAMA 财报 UI VGG-16 AI Transformers 多线程 uWSGI Math BeautifulSoup OpenCV Bitcoin Bin SVR YOLO Food ResNet-50 Distillation 搞笑 FP8 Website Windows CV CAM Pickle Qwen2.5 Agent 域名 Paddle RAR git PIP tqdm GPTQ Pytorch GGML Hilton Qwen2 Llama TTS Rebuttal Vmess MD5 Markdown 飞书 UNIX Crawler SPIE Firewall Git torchinfo Video Michelin GPT4 QWEN logger Land Paper tar 公式 Pandas Github Datetime DeepStream Random v0.dev Image2Text git-lfs FP32 XGBoost ModelScope HuggingFace printf Zip BF16 Plate XML Clash Permission EXCEL Translation Proxy ChatGPT Bert NLP Django 证件照 InvalidArgumentError 腾讯云 Quantization 版权 CC WAN 递归学习法 Anaconda CEIR 算法题 TensorFlow COCO Numpy 阿里云 mmap 净利润 Freesound Disk WebCrawler Hotel PDB Attention uwsgi LLM NLTK Interview GoogLeNet CSV OpenAI Pillow Augmentation Python Jupyter FastAPI LeetCode Hungarian PyTorch IndexTTS2 Magnet Ubuntu Base64 Baidu Tracking Domain 顶会 diffusers 音频 继承 Tensor CTC ONNX scipy Linux Jetson Plotly Qwen JSON FlashAttention Vim LaTeX Password Color Card Statistics Docker 图形思考法 VSCode Search Breakpoint CUDA FP16 Quantize Tiktoken Mixtral Claude Safetensors 论文速读 HaggingFace Diagram v2ray Data 关于博主 News Nginx CLAP PDF LoRA BTC
站点统计

本站现有博文328篇,共被浏览841553

本站已经建立2546天!

热门文章
文章归档
回到顶部