EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
图标 第一性原理 搞笑 Breakpoint Claude printf Template Streamlit LaTeX AI Plate Rebuttal Dataset Miniforge Image2Text uWSGI CC Markdown QWEN Vmess Magnet Domain COCO WAN CSV Bipartite FP8 ChatGPT OpenCV Quantize Jupyter FP64 IndexTTS2 SPIE Hilton Windows Jetson hf Zip GIT Crawler Safetensors CEIR 净利润 Docker SQL Tracking 递归学习法 Logo Web Sklearn Review CTC OCR 飞书 transformers tar ResNet-50 Bert C++ LeetCode Augmentation TSV v2ray Shortcut Pillow Google Algorithm Freesound Paddle Pandas 报税 VSCode Card GPT4 git-lfs Proxy Input Pytorch mmap Transformers Anaconda Pickle ONNX 继承 Gemma YOLO Hungarian Linux OpenAI Password Disk Random ModelScope InvalidArgumentError HaggingFace UNIX Heatmap Hotel llama.cpp CLAP Bin git v0.dev 图形思考法 RGB Vim TTS NameSilo Base64 Qwen torchinfo PDF Diagram CV GPTQ Cloudreve TensorRT 公式 Color Nginx Translation 多进程 logger NLTK Agent Tensor Plotly 财报 Django Search SVR Knowledge Michelin Data CAM Conda XGBoost 算法题 Ptyhon diffusers Interview TensorFlow uwsgi 签证 GoogLeNet icon 阿里云 BTC VGG-16 域名 Numpy PIP HuggingFace Llama BF16 音频 FlashAttention LLAMA Baidu Qwen2.5 News Github XML Use Ubuntu SAM Attention 证件照 Animate MD5 WebCrawler PyTorch FastAPI Math Git JSON 版权 Statistics Permission API GGML Video Clash 云服务器 强化学习 tqdm VPN 关于博主 Paper Website DeepStream Bitcoin LLM EXCEL Mixtral Food Excel RAR Land FP32 UI Distillation PDB scipy 多线程 Quantization PyCharm LoRA DeepSeek NLP Datetime SQLite CUDA Python Firewall FP16 顶会 腾讯云 BeautifulSoup Qwen2 Tiktoken
站点统计

本站现有博文324篇,共被浏览807418

本站已经建立2508天!

热门文章
文章归档
回到顶部