Open Source OCR Engine
A pure Javascript Multilingual OCR
PDF to Markdown with vision models
Formula recognition based on LaTeX-OCR and ONNXRuntime
OCRmyPDF adds an OCR text layer to scanned PDF files
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Awesome multilingual OCR toolkits based on PaddlePaddle
Ready-to-use OCR with 80+ supported languages
A community-supported supercharged version of paperless
Java interface to OpenCV, FFmpeg, and more
Web application that allows you to perform operations on PDF files
A framework to enable multimodal models to operate a computer
Library for OCR-related tasks powered by Deep Learning
PDF scientific paper translation with preserved formats
Open Source Document Management System for Digital Archives
Math OCR model that outputs LaTeX and markdown
Convert AI papers to GUI
A high-quality tool for convert PDF to Markdown and JSON
Open source clipboard management tools for Windows, Macos and Linux
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Qwen3-omni is a natively end-to-end, omni-modal LLM
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Free Open Source Enterprise Grade RPA
WindowTextExtractor allows you to get a text from any OS
Converts PDF, DOC, DOCX, XML, HTML, RTF, etc to plain text