A comprehensive list of document parsers, covering PDF-to-text conversion and layout extraction. Each tested for support of tables, equations, handwriting, two-column layouts, and multi-column layouts.
☆178Jul 14, 2025Updated 11 months ago
Alternatives and similar repositories for document-parsers-list
Users that are interested in document-parsers-list are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple streamlit app, dockerized, to do OCR on documents. I'm lazy, idk.☆26Aug 18, 2025Updated 10 months ago
- A Python package for zero-shot text anonymization using Transformer-based NER models.☆84Dec 16, 2025Updated 6 months ago
- ☆12May 30, 2025Updated last year
- Minimal web client for chatting and roleplay with AI characters☆26Aug 21, 2025Updated 10 months ago
- Production-grade OpenClaw personal assistant setup. Security-hardened, 15+ custom tools, Purple-Team audited. Templates & architecture do…☆82Mar 25, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Empower anyone to code something useful☆31Jun 9, 2025Updated last year
- 200+ pages/s pdf extractor (tables, bold, italic, jazz)☆108May 24, 2026Updated last month
- A tool for adding function calling to llm api, available as a service by following the link☆22Aug 11, 2025Updated 10 months ago
- High-Performance Text Deduplication Toolkit☆61Aug 25, 2025Updated 10 months ago
- Data Science Foundations: Python Scientific Stack☆11Jun 2, 2022Updated 4 years ago
- A unit test framework for prompts.☆11Feb 9, 2023Updated 3 years ago
- A Demo of Running Sleep-time Compute to Reduce LLM Latency☆16May 17, 2025Updated last year
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆108Nov 5, 2025Updated 7 months ago
- Intelligent file organization with computer vision, audio analysis, chunking, proactive AI-powered analysis, interactive classification, …☆39Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A simple CPU only OCR for pdf/images/word/excel to markdown. With streamlit.☆51Jan 26, 2026Updated 5 months ago
- ☆13Jun 18, 2024Updated 2 years ago
- A simple Docker Compose boilerplate for deploying Open WebUI and LiteLLM with Traefik for personal LLM use. Securely manage and access la…☆21Jun 3, 2025Updated last year
- Cookiecutter template for MCP servers with one-click Render.com deployment - Generate production-ready API integration servers in minutes☆18Jul 4, 2025Updated 11 months ago
- Crashbench is a LLM benchmark to measure bug-finding and reporting capabilities of LLMs☆14Mar 8, 2026Updated 3 months ago
- ☆14Aug 23, 2024Updated last year
- ☆17Mar 11, 2025Updated last year
- Reasoning Systems with tool use are strong zero-shot object detectors☆58Oct 9, 2025Updated 8 months ago
- Byte-Vision is a privacy-first document intelligence platform that transforms static documents into an interactive, searchable knowledge …☆70Nov 28, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆39Sep 7, 2025Updated 9 months ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 5 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12May 30, 2025Updated last year
- ☆17Feb 22, 2025Updated last year
- This workshop covers the entire process of using Milvus—from installation and basic concepts to core operations and practical application…☆34Jun 4, 2026Updated 3 weeks ago
- A lightweight MCP server that integrates with Apple Notes to create a personal memory system for AI. Easily recall and save information f…☆12Apr 7, 2025Updated last year
- LLM based map for research, exploration and discovery.☆51Feb 11, 2025Updated last year
- A free, source available personal finance app: connect your bank, track spending, and build budgets; secure with clear, real‑time insight…☆36Updated this week
- code for Towards Data Science article on prompt-loss-weight☆11Jun 4, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A simple streamlit app to play with qwen3-2b-VL to perform OCR. Dockerized set up, tested with 3060 12 GB.☆32Nov 23, 2025Updated 7 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Apr 2, 2025Updated last year
- ☆13Mar 10, 2025Updated last year
- The desktop app for ComfyUI☆164Updated this week
- interactive semantic search demo using Qwen3-0.6B-Embedding in your browser☆60Feb 25, 2026Updated 4 months ago
- Simple HTML template library for C++☆14Feb 3, 2021Updated 5 years ago
- Collaborative AI Model☆11Nov 27, 2024Updated last year