EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
CUDA CAM 音频 Sklearn ModelScope Llama UI Mixtral Algorithm FastAPI Rebuttal JSON uWSGI printf 签证 TTS Zip MD5 第一性原理 scipy Review Password PDB Pickle Card git-lfs CSV 关于博主 llama.cpp 强化学习 Food 净利润 Michelin Translation Qwen2.5 GGML BTC WebCrawler OpenCV hf LLAMA Base64 Anaconda VSCode Hilton Github 飞书 DeepStream 腾讯云 Land PDF LLM WAN PIP DeepSeek Miniforge Django torchinfo Statistics Plate Vim Conda v2ray XML Transformers Jetson ResNet-50 多线程 Image2Text Python CEIR Crawler Safetensors icon AI Nginx InvalidArgumentError SQLite 证件照 搞笑 Streamlit ChatGPT TensorRT Dataset Google Markdown LeetCode UNIX 云服务器 Search Breakpoint v0.dev Bert YOLO logger NLP NameSilo Magnet Shortcut RAR Data Bin PyTorch SPIE Proxy FP8 CC GoogLeNet PyCharm CV HuggingFace COCO Bitcoin FP16 报税 阿里云 SQL Claude Quantization Jupyter Freesound 域名 Pytorch LoRA Template FlashAttention QWEN CTC transformers Hungarian BF16 Color Gemma 算法题 Heatmap VPN Tensor Tiktoken OCR Cloudreve IndexTTS2 Knowledge Bipartite 继承 Paper Clash Website Paddle Attention 版权 tqdm Use TensorFlow 多进程 Firewall LaTeX Ptyhon Numpy GIT API Input Domain tar diffusers Git Plotly 公式 财报 Logo Random GPT4 XGBoost VGG-16 mmap Hotel Excel Math Permission Ubuntu FP64 Pandas Qwen2 git Disk Animate Datetime 递归学习法 图标 Distillation TSV Agent News uwsgi Diagram Quantize Web CLAP OpenAI Baidu FP32 Qwen C++ Augmentation SVR Vmess HaggingFace Windows ONNX 顶会 Interview RGB BeautifulSoup Docker Video Pillow Linux EXCEL NLTK GPTQ Tracking 图形思考法 SAM
站点统计

本站现有博文324篇,共被浏览823413

本站已经建立2529天!

热门文章
文章归档
回到顶部