EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Diagram NLTK Transformers Excel Streamlit VSCode IndexTTS2 GoogLeNet ONNX RAR VGG-16 SAM InvalidArgumentError LLM 视频信息 hf 证件照 LaTeX Jetson HaggingFace CAM Card HuggingFace LeetCode WAN GPT4 Hotel CEIR Qwen2 UI PyCharm Clash transformers TSV SQL v2ray OpenCV Paddle TensorRT EXCEL Django Bin GGML Color Use Quantization Video Data FlashAttention 签证 Pillow Conda DeepStream Math Hungarian Proxy CUDA SQLite Mixtral tar Distillation 多线程 Paper CLAP Pickle tqdm diffusers v0.dev Image2Text PDB git-lfs Web BTC API Crawler 算法题 Claude FP8 报税 Attention CSV 域名 Input Review Python Zip ModelScope 多进程 Markdown COCO C++ Algorithm Password 腾讯云 uwsgi 净利润 Land Hilton XGBoost 继承 Animate PDF Git FP32 Qwen2.5 Ubuntu Disk Github NameSilo TensorFlow Docker 阿里云 scipy Pandas Anaconda Domain BeautifulSoup uWSGI Plotly UNIX Vmess printf RGB Bitcoin OCR Sklearn ChatGPT FP64 关于博主 Random Food Michelin Permission 财报 DeepSeek GIT Shortcut Ptyhon QWEN PyTorch Tensor JSON Bipartite Gemma MD5 Statistics AI Magnet YOLO XML Logo Freesound Heatmap Pytorch 公式 torchinfo mmap WebCrawler FP16 git Plate Baidu GPTQ SVR TTS Linux NLP Bert FastAPI Base64 飞书 Augmentation Website Miniforge Cloudreve Breakpoint CTC Tiktoken llama.cpp Interview LLAMA Qwen Dataset Firewall 版权 VPN SPIE 搞笑 CC Tracking Quantize ResNet-50 Google Template 音频 OpenAI Vim BF16 Numpy Jupyter Windows Safetensors Translation Datetime PIP LoRA Knowledge logger Llama CV Nginx
站点统计

本站现有博文311篇,共被浏览740733

本站已经建立2378天!

热门文章
文章归档
回到顶部