Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
machine-learning ocr openai multi-modal educational-data table-parsing ml-datasets exam-ocr doclayout paper-ocr
-
Updated
Apr 8, 2025 - Python