EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Bert Claude 报税 UNIX SQLite LeetCode 继承 Data GoogLeNet v2ray Algorithm Land FP8 Proxy Augmentation InvalidArgumentError Tracking UI diffusers Qwen2.5 VGG-16 阿里云 SPIE Password TSV Qwen2 Hotel 顶会 Vim Agent tqdm FastAPI Pandas transformers 第一性原理 Linux uWSGI TensorFlow IndexTTS2 Shortcut PyCharm LaTeX hf BF16 Vmess Pickle Paddle Color Firewall HuggingFace CEIR JSON Food Translation Input CUDA Google API Clash Base64 Domain FlashAttention COCO Sklearn Template Excel QWEN Quantize Image2Text 多进程 LLM C++ scipy llama.cpp Freesound git-lfs uwsgi OpenAI WAN Plate CC AI LLAMA DeepSeek Docker SVR 签证 云服务器 DeepStream 多线程 音频 PDF XML OpenCV CAM printf Streamlit Ubuntu Pillow Dataset git FP64 Statistics Python OCR Quantization Llama 公式 净利润 Hungarian SAM Pytorch logger Tiktoken Jetson CLAP 腾讯云 Paper TensorRT Diagram tar ONNX TTS Permission BeautifulSoup MD5 Card Magnet NameSilo 图形思考法 SQL RGB Ptyhon Bitcoin NLP ModelScope Knowledge 递归学习法 Bipartite Django FP16 Attention GGML Math LoRA CV PIP Markdown Zip Cloudreve Breakpoint Git Random Gemma ChatGPT WebCrawler EXCEL Video Windows torchinfo ResNet-50 Anaconda Baidu CTC Github HaggingFace Bin BTC Web Qwen 飞书 GIT FP32 Safetensors 证件照 Tensor 关于博主 v0.dev Logo PyTorch GPTQ 版权 XGBoost Mixtral Miniforge Numpy VSCode Transformers Plotly Interview 算法题 Animate Distillation Hilton Nginx News mmap Review 强化学习 域名 VPN RAR PDB Datetime Crawler CSV Conda Website Disk NLTK Search Heatmap Jupyter Michelin 财报 搞笑 Use YOLO GPT4
站点统计

本站现有博文321篇,共被浏览771602

本站已经建立2459天!

热门文章
文章归档
回到顶部