EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
VGG-16 Input llama.cpp PDB XGBoost Bitcoin FP32 算法题 OCR GGML Statistics Review Distillation ModelScope Michelin SVR Color LaTeX OpenCV WAN Hilton Vmess 报税 GPTQ NLP Bipartite 图形思考法 LLM torchinfo Augmentation Excel 飞书 JSON CC DeepStream Translation Base64 HuggingFace CUDA logger Data Vim 证件照 第一性原理 继承 VSCode PyCharm PIP SAM BF16 Attention git-lfs Password Baidu Datetime Anaconda Breakpoint Tracking Nginx Paddle Numpy Qwen2 C++ TensorRT DeepSeek Gemma Shortcut UI 公式 VPN CSV ChatGPT Pytorch Agent GPT4 WebCrawler Zip Hungarian 阿里云 YOLO Search Pillow CAM SQLite Claude CEIR Permission 顶会 Jetson Qwen2.5 HaggingFace Ptyhon RAR Domain TSV mmap AI FP8 Animate Image2Text hf CLAP Video Mixtral 净利润 News Template NameSilo Rebuttal Freesound 搞笑 Tiktoken 签证 Plotly Heatmap Linux v0.dev 强化学习 LLAMA Crawler FastAPI Random Google 关于博主 tqdm uwsgi v2ray tar printf Knowledge LeetCode Paper Transformers UNIX transformers CV ResNet-50 MD5 Llama Docker Website Interview Github Diagram Land 云服务器 Web Hotel Pickle Dataset Django GoogLeNet 递归学习法 Markdown 多线程 QWEN 域名 Cloudreve Pandas scipy Miniforge Jupyter SQL TTS 财报 BTC 版权 OpenAI Conda XML 音频 Magnet Sklearn Use Proxy git NLTK uWSGI Ubuntu Quantize GIT Algorithm diffusers API EXCEL Bin Food FP16 PyTorch Safetensors FP64 Streamlit CTC Quantization LoRA Tensor icon IndexTTS2 Python TensorFlow Firewall Disk Math Bert Plate Logo SPIE 多进程 ONNX BeautifulSoup Clash FlashAttention RGB 图标 Windows Git Qwen InvalidArgumentError 腾讯云 Card COCO PDF
站点统计

本站现有博文324篇,共被浏览807390

本站已经建立2508天!

热门文章
文章归档
回到顶部