EADST

SPIE 2020 Papers

Dong Xie and Colleen P. Bailey "Novel receipt recognition with deep learning algorithms", Proc. SPIE 11400, Pattern Recognition and Tracking XXXI, 114000B (22 April 2020); https://doi.org/10.1117/12.2558206

Abstract

We propose a new recognition method to extract effective information from receipts by integrating deep learning algorithms from computer vision and natural language processing. Our method consists of three parts. The first part provides effective areas for receipt detection. By removing noise and extracting the gradient of the receipt image, we determine the threshold to crop and reshape the useful receipt area. Detecting text from a receipt image is the second part, we modify and deploy the text detection algorithm connectionist text proposal network (CTPN) to locate the text region in the receipt. In the third part, we import the connectionist temporal classification with maximum entropy regularization as the loss function for updating the convolutional recurrent neural networks (CRNN) to recognize the text detection area, which converts the receipt from an image into the text. Based on our method, the effective information of a receipt can be integrated and utilized. We train and test our system using the data set published by scanned receipts optical character recognition and information extraction (SROIE). The results illustrate that our recognition system is able to identify receipt information quickly and accurately.

Paper Download

Arthur C. Depoian, Lorenzo E Jaques, Dong Xie, Colleen P. Bailey, and Parthasarathy Guturu "Computer vision learning techniques for sports video analytics: removing overlays", Proc. SPIE 11395, Big Data II: Learning, Analytics, and Applications, 113950M (24 April 2020); https://doi.org/10.1117/12.2560888

Abstract

Big data has been driving professional sports over the last decade. In our data-driven world, it becomes important to find additional methods for the analysis of both games and athletes. There is an abundance of videos taken in professional and amateur sports. Player datasets can be created utilizing computer vision techniques. We propose a novel approach by creating an autonomous masking algorithm that can receive live or previously recorded video footage of sporting events. This procedure can identify graphical overlays to optimize further processing by tracking and text recognition algorithms for real-time analysis.

Paper Download

相关标签
About Me
XD
Goals determine what you are going to be.
Category
标签云
Windows logger Magnet Random Hungarian ResNet-50 FP32 Domain Jupyter DeepSeek scipy TensorRT GoogLeNet Tensor Vim 公式 ChatGPT CSV Excel git Bert CV Proxy Image2Text FP16 搞笑 PyTorch GPTQ Paper Hotel Anaconda LeetCode Data 递归学习法 Zip 阿里云 Knowledge GPT4 QWEN Algorithm Google Linux PIP MD5 Qwen CEIR Augmentation Color InvalidArgumentError Numpy Heatmap Github RGB Quantize DeepStream 继承 Distillation Tiktoken UNIX 版权 uWSGI Bitcoin Pickle Nginx Safetensors SVR Transformers BF16 News Streamlit 关于博主 PDF OpenAI Use Statistics transformers torchinfo 顶会 Pytorch GGML printf Llama Video Quantization tar Jetson SQLite Docker Django CUDA v0.dev git-lfs Interview NameSilo 腾讯云 hf Math EXCEL tqdm Firewall OpenCV Base64 CAM COCO 净利润 NLP Hilton Input Cloudreve Shortcut llama.cpp SPIE Logo Mixtral Freesound Vmess Translation Agent VPN Bin Datetime 多线程 JSON Permission API SQL 签证 Diagram 第一性原理 ModelScope C++ Disk WebCrawler Conda 报税 diffusers PyCharm 证件照 NLTK VSCode 域名 LaTeX FP8 Bipartite Template RAR Website YOLO 算法题 Git Crawler Sklearn Ptyhon HaggingFace PDB uwsgi Attention Qwen2.5 IndexTTS2 财报 Claude Miniforge SAM HuggingFace BTC Clash CTC mmap Web GIT LLAMA Michelin Review Markdown 强化学习 WAN Password ONNX Python OCR 飞书 Pandas Animate Baidu Pillow Plate Breakpoint UI 多进程 Land Card BeautifulSoup FastAPI LoRA Food XML CC Plotly Gemma LLM Ubuntu Search CLAP XGBoost AI FP64 VGG-16 TTS Dataset v2ray 音频 Qwen2 Paddle TSV 图形思考法 TensorFlow Tracking FlashAttention
站点统计

本站现有博文320篇,共被浏览755682

本站已经建立2418天!

热门文章
文章归档
回到顶部