![](/rp/kFAqShRrnkQMbH6NYLBYoJ3lq9s.png)
xinke-wang/OCRDatasets: A collection of OCR-related datasets - GitHub
This repo collects OCR-related datasets. In general, the datasets are classified by 6 types, i.e., Natural Scene Text, Document Text, Handwritten Text, Historical Document Text, Video Text, and Synthetic Text.
standard OCR dataset - Kaggle
Optical Character Recognition Dataset containing Various Fonts and Style Kaggle uses cookies from Google to deliver and enhance the quality of its services and to analyze traffic. Learn more
OCR Image Datasets - FutureBeeAI
Perfect for machine learning and AI projects, our OCR image datasets are essential for refining text recognition algorithms, improving data extraction accuracy, and advancing document digitization initiatives. Access high-quality, diverse data sets …
A Comprehensive List of OCR Datasets for Machine Learning
2023年8月11日 · To build accurate and robust OCR models, access to high-quality training data is crucial. In this blog, we present a comprehensive list of OCR datasets that are invaluable resources for training...
ZumingHuang/awesome-ocr-resources - GitHub
2025年1月5日 · This repository contains a comprehensive collection of resources related to OCR (Optical Character Recognition) and Document AI, such as papers, datasets, and APIs.
Machine Learning Datasets - Papers With Code
50 dataset results for Optical Character Recognition (OCR) IAM (IAM Handwriting) The IAM database contains 13,353 images of handwritten lines of text created by 657 writers.
PDF Document / OCR Datasets - a pixparse Collection - Hugging …
2024年3月29日 · Document datasets with .pdf files that are usable with pixparse libraries and tools.
OCR datasets - PaddleOCR Documentation
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Standard OCR dataset - AI Data Collection Company
Optical Character Recognition (OCR) models with our Standard OCR Dataset. Download now for improved AI research in text extraction.
TextOCR Dataset - Papers With Code
TextOCR is a dataset to benchmark text recognition on arbitrary shaped scene-text. TextOCR requires models to perform text-recognition on arbitrary shaped scene-text present on natural images.
- 某些结果已被删除