A Unified Toolkit for Deep Learning Based Document Image Analysis
-
Updated
Aug 15, 2024 - Python
A Unified Toolkit for Deep Learning Based Document Image Analysis
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
The official repo for “DocScanner: Robust Document Image Rectification with Progressive Learning”, IJCV, 2025.
Detectron2 for Document Layout Analysis
The official code for “Geometric Representation Learning for Document Image Rectification”, ECCV, 2022.
文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSharpening / HandwritingDenoisingBeautifying / DocShadowRemoval / document_image_dewarping / DocTrimmingEnhancement)。
Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents
The ScriptNet / competitions site.
This script automates the process of extracting text from various file formats (images, PDFs, DOCX) using Optical Character Recognition (OCR) powered by Azure Cognitive Services. The script supports image preprocessing, text extraction, and uploading of the processed files to Google Cloud Storage (GCP).
Sophia Trikoupi dataset (Collection of 46 handwritten, annotated pages)
Add a description, image, and links to the document-image-processing topic page so that developers can more easily learn about it.
To associate your repository with the document-image-processing topic, visit your repo's landing page and select "manage topics."