A Comprehensive Toolkit for High-Quality PDF Content Extraction
☆9,402Jan 3, 2025Updated last year
Alternatives and similar repositories for PDF-Extract-Kit
Users that are interested in PDF-Extract-Kit are comparing it to the libraries listed below
Sorting:
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆54,870Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆31,857Feb 9, 2026Updated 2 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Updated this week
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception☆2,017Apr 14, 2025Updated 10 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,084Feb 10, 2025Updated last year
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,800Dec 12, 2025Updated 2 months ago
- Using GPT to parse PDF☆3,562Apr 17, 2025Updated 10 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆31,031Feb 20, 2026Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆16,947Feb 19, 2026Updated last week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,777Jul 4, 2025Updated 7 months ago
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆73,900Updated this week
- Get your documents ready for gen AI☆54,094Updated this week
- ☆548Jul 26, 2024Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,074Updated this week
- OCR & Document Extraction using vision models☆12,144May 20, 2025Updated 9 months ago
- Universal memory layer for AI Agents☆47,994Updated this week
- An open-source RAG-based tool for chatting with your documents.☆25,152Jul 4, 2025Updated 7 months ago
- Data annotation toolbox supports image, audio and video data.☆1,503Oct 1, 2025Updated 5 months ago
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆458Sep 28, 2025Updated 5 months ago
- A High-efficiency Open-source Toolkit for Table-to-Latex Task☆275Dec 6, 2025Updated 2 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆27,918Sep 30, 2025Updated 5 months ago
- Python tool for converting files and office documents to Markdown.☆87,527Feb 20, 2026Updated last week
- The Open-Source Data Annotation Platform☆1,181Feb 19, 2025Updated last year
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)☆67,659Updated this week
- Question and Answer based on Anything.☆13,859Mar 24, 2025Updated 11 months ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆60,971Updated this week
- Implementation of Nougat Neural Optical Understanding for Academic Documents☆9,848Feb 21, 2025Updated last year
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆52,724Updated this week
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆23,942Updated this week
- Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/…☆71,012Feb 16, 2026Updated last week
- Retrieval and Retrieval-augmented LLMs☆11,329Dec 15, 2025Updated 2 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,278Feb 21, 2025Updated last year
- Production-ready platform for agentic workflow development.☆130,029Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- The programming language for agentic software. Build, run, and manage multi-agent systems at scale.☆38,104Updated this week
- LlamaIndex is the leading document agent and OCR platform☆47,210Updated this week
- SOTA Open Source TTS☆24,983Feb 2, 2026Updated 3 weeks ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,370May 30, 2025Updated 9 months ago