PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.
☆11,082Apr 3, 2026Updated this week
Alternatives and similar repositories for opendataloader-pdf
Users that are interested in opendataloader-pdf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python tool for converting files and office documents to Markdown.☆93,259Mar 30, 2026Updated last week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆35,501Updated this week
- Get your documents ready for gen AI☆57,163Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆17,100Mar 25, 2026Updated 2 weeks ago
- OpenViking is an open-source context database designed specifically for AI Agents(such as openclaw). OpenViking unifies the management of…☆21,768Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!☆34,570Updated this week
- Open-source AI coworker, with memory☆9,355Apr 3, 2026Updated last week
- [MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on …☆10,743Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆33,352Updated this week
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆58,131Apr 3, 2026Updated last week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆86,467Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,557Updated this week
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆104,291Apr 1, 2026Updated last week
- An agentic skills framework & software development methodology that works.☆135,360Apr 2, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, s…☆59,375Updated this week
- Build, deploy, and orchestrate AI agents. Sim is the central intelligence layer for your AI workforce.☆27,472Apr 3, 2026Updated last week
- 📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG☆24,755Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆63,500Updated this week
- 🔥 The Web Data API for AI - Power AI agents with clean web data☆104,217Updated this week
- An open-source RAG-based tool for chatting with your documents.☆25,251Apr 3, 2026Updated last week
- A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude'…☆45,901Updated this week
- Multilingual Document Layout Parsing in a Single Vision-Language Model☆8,183Mar 24, 2026Updated 2 weeks ago
- Universal memory layer for AI Agents☆52,137Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- OCR model that handles complex tables, forms, handwriting with full layout.☆8,268Mar 18, 2026Updated 3 weeks ago
- Lightpanda: the headless browser designed for AI and automation☆27,576Updated this week
- FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app,…☆134,174Mar 28, 2026Updated last week
- mini cli search engine for your docs, knowledge bases, meeting notes, whatever. Tracking current sota approaches while being all local☆17,547Mar 30, 2026Updated last week
- Build Real-Time Knowledge Graphs for AI Agents☆24,507Updated this week
- A lightweight, lightning-fast, in-process vector database☆9,226Apr 3, 2026Updated last week
- 🪄 Create rich visualizations with AI☆15,203Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,343Feb 21, 2025Updated last year
- Build, run, manage agentic software at scale.☆39,153Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆8,888Mar 25, 2026Updated 2 weeks ago
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆42,652Updated this week
- A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.☆17,305Updated this week
- The agent that grows with you☆34,947Updated this week
- Knowledge Engine for AI Agent Memory in 6 lines of code☆14,989Updated this week
- An open source, privacy focused alternative to NotebookLM for teams with no data limit's. Join our Discord: https://discord.gg/ejRNvftDp9☆13,637Apr 3, 2026Updated last week
- Open Source AI Platform - AI Chat with advanced features that works with every LLM☆25,551Updated this week