EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Diagram Pandas VGG-16 API TSV 签证 Jupyter EXCEL CV Heatmap Web Interview OpenAI CUDA Permission Google FP32 GGML CEIR 净利润 Dataset Distillation Cloudreve uWSGI Pillow Ptyhon NameSilo LaTeX DeepSeek transformers Crawler 强化学习 FastAPI Llama Paper SPIE uwsgi HuggingFace Zip VPN Knowledge BTC MD5 Excel Food Attention 算法题 QWEN VSCode Hotel Clash SQLite RAR Docker YOLO DeepStream 飞书 CC SVR Miniforge Numpy ModelScope Claude IndexTTS2 CLAP scipy Transformers Sklearn RGB Streamlit tar Review Markdown LoRA Augmentation ResNet-50 JSON Quantize GPT4 hf Gemma Baidu 多进程 AI 顶会 CSV 图形思考法 Plotly CTC 财报 OCR printf Math Search NLTK 关于博主 Password ChatGPT Vmess Nginx mmap GoogLeNet Qwen2 v0.dev BeautifulSoup Quantization Magnet Bert News 阿里云 Breakpoint Vim Datetime UI Proxy tqdm Jetson FP8 音频 Michelin git-lfs logger C++ Data 第一性原理 Hungarian PDF Input Logo Card FlashAttention diffusers OpenCV Paddle 报税 Django Github Pytorch XGBoost SAM CAM Image2Text Mixtral 递归学习法 GPTQ LLAMA XML git GIT 多线程 Algorithm Base64 Qwen2.5 Random FP16 Shortcut 版权 torchinfo Hilton Bitcoin TTS v2ray Python llama.cpp Safetensors 证件照 LLM Tiktoken COCO Pickle Linux PDB Tensor HaggingFace TensorFlow UNIX InvalidArgumentError Tracking FP64 继承 PyCharm ONNX Windows Land Domain PIP Ubuntu Plate Animate 搞笑 Conda Freesound LeetCode PyTorch Firewall Anaconda 域名 Use TensorRT WAN 公式 Translation BF16 Template WebCrawler Color Disk Git Video Bipartite SQL Website Agent Qwen 腾讯云 Bin Statistics 云服务器 NLP
站点统计

本站现有博文321篇,共被浏览763811

本站已经建立2439天!

热门文章
文章归档
回到顶部