EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Claude Google PDF Plate VPN PIP Tiktoken Python InvalidArgumentError Sklearn IndexTTS2 Magnet Firewall Base64 v2ray SAM ChatGPT TSV Crawler BeautifulSoup GoogLeNet BTC GIT Zip LLM 搞笑 Proxy Numpy Safetensors Vim Hungarian NameSilo git SPIE 图标 LaTeX Hilton 签证 证件照 XML Shortcut CEIR Permission Heatmap Image2Text Linux GPT4 tqdm CC Password CLAP LeetCode diffusers 多线程 递归学习法 TTS UNIX Distillation scipy CSV WAN EXCEL git-lfs Docker Diagram FP64 Input Anaconda Miniforge Template ONNX BF16 HaggingFace Llama CUDA Jetson hf PDB FlashAttention Card Tensor DeepStream Augmentation Statistics v0.dev VGG-16 图形思考法 C++ SQLite Bin Quantize Knowledge Dataset Bitcoin logger 第一性原理 Use Plotly FP32 OpenCV Data 强化学习 TensorFlow Datetime Baidu tar mmap QWEN Attention Land TensorRT llama.cpp Pillow Git Jupyter JSON 财报 FP8 ResNet-50 Review Gemma Pickle 关于博主 FastAPI Search Transformers Github VSCode 飞书 Interview Quantization SQL YOLO Freesound Website Pandas Animate 腾讯云 算法题 ModelScope Ptyhon Web 多进程 AI Food uwsgi News Math 报税 Breakpoint 云服务器 Markdown Django Cloudreve Agent transformers RGB DeepSeek Paddle CTC Translation Algorithm 继承 HuggingFace OpenAI Bert Disk Qwen OCR Windows Paper 顶会 Logo Tracking 域名 Conda Pytorch Clash Domain Mixtral Vmess printf MD5 RAR Video FP16 SVR Excel 音频 阿里云 LoRA COCO Hotel LLAMA icon NLTK Nginx GGML 版权 torchinfo NLP Michelin Qwen2.5 Ubuntu Qwen2 PyTorch 公式 Random CV UI uWSGI Streamlit WebCrawler CAM Color API 净利润 XGBoost Bipartite PyCharm GPTQ
站点统计

本站现有博文322篇,共被浏览786050

本站已经建立2480天!

热门文章
文章归档
回到顶部