EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
LLAMA Tiktoken Random 证件照 CEIR Input Numpy 图标 Safetensors FlashAttention Distillation SPIE Pandas 多线程 BeautifulSoup 域名 GPT4 Linux scipy llama.cpp icon printf Cloudreve TTS Hilton Bert Conda git-lfs Google 算法题 TensorRT PDB Template Python ResNet-50 Llama Streamlit GIT Transformers Zip PyTorch Review Clash GoogLeNet InvalidArgumentError API Magnet Video Firewall Michelin IndexTTS2 Ubuntu Card logger 强化学习 JSON transformers Windows torchinfo Color 报税 BF16 Math 音频 CTC Quantization 论文速读 Search OpenAI C++ Pillow Docker FP16 Agent Ptyhon Rebuttal Permission Django PIP WebCrawler Freesound DeepSeek Github Data Hungarian 第一性原理 DeepStream tqdm CUDA Jupyter uWSGI Knowledge 论文 Markdown UI Land VGG-16 Nginx diffusers TSV Diagram Attention GGML Git HaggingFace 公式 Shortcut FP64 顶会 Translation Qwen2 Interview Use CV Password 腾讯云 Bitcoin Base64 XML Paddle ModelScope Website YOLO PDF LLM NLP FastAPI 云服务器 SVR Proxy COCO 飞书 HuggingFace Web RAR mmap 财报 WAN ONNX CC hf Gemma Pickle LaTeX MD5 Datetime v0.dev NLTK Jetson Paper TensorFlow Claude 签证 uwsgi LoRA 关于博主 ChatGPT CSV Miniforge 递归学习法 Animate FP32 OCR Heatmap Statistics Vmess LeetCode 多进程 CAM SQLite Qwen Crawler OpenCV EXCEL Anaconda XGBoost VSCode Disk Dataset 阿里云 Algorithm tar Quantize git AI QWEN RGB PyCharm Augmentation Vim SAM 搞笑 版权 FP8 Bipartite Pytorch Image2Text UNIX Plate News 图形思考法 Bin 继承 Food Mixtral NameSilo 净利润 Baidu SQL Domain Logo v2ray Plotly Excel BTC Sklearn CLAP Hotel Tracking Tensor VPN GPTQ Breakpoint Qwen2.5
站点统计

本站现有博文328篇,共被浏览858417

本站已经建立2566天!

热门文章
文章归档
回到顶部