PDF to Markdown with vision models
OCRmyPDF adds an OCR text layer to scanned PDF files
A high-quality tool for convert PDF to Markdown and JSON
PDF scientific paper translation with preserved formats
Open Source Document Management System for Digital Archives
A community-supported supercharged version of paperless
A Repo For Document AI
A Python application to add watermarks (text or image) to PDF files
A supercharged version of paperless, scan, index and archive docs
Easy-OCR solution and Tesseract trainer for GNU/Linux
Virtual Appliance of RadicalSpam