Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
☆1,383Oct 31, 2025Updated 4 months ago
Alternatives and similar repositories for docstrange
Users that are interested in docstrange are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,877Mar 17, 2026Updated last week
- CLI tool for managing Model Context Protocol (MCP) servers in one place & using them across them different clients☆25Apr 23, 2025Updated 11 months ago
- ☆1,290Updated this week
- MAESTRO is an AI-powered research application designed to streamline complex research tasks.☆1,452Oct 12, 2025Updated 5 months ago
- Modern, type-safe, zero-dependency Python library for serial port I/O access☆23Dec 16, 2025Updated 3 months ago
- Add AI capabilities to your file system using Ollama, Groq, OpenAi and other's api☆209Jan 4, 2025Updated last year
- The most accurate document search and store for building AI apps☆3,541Feb 25, 2026Updated 3 weeks ago
- A next-gen Python plotting library with SVG-first rendering, interactivity, themes, and clean defaults — better than matplotlib.pyplot☆206May 5, 2025Updated 10 months ago
- Get your documents ready for gen AI☆56,339Updated this week
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Sep 5, 2024Updated last year
- A Multi-Agentic AI Assistant/Builder☆25Jan 23, 2026Updated 2 months ago
- Simple CLI for managing Postgres databases in Flask.☆21May 26, 2024Updated last year
- 🔍📃 LLM-powered PDF Table Extractor☆19Jun 26, 2025Updated 8 months ago
- Enable tool/function calling for any LLM, in OpenAI and Ollama API formats, adding universal function calling to models without native su…☆71Dec 9, 2025Updated 3 months ago
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆8,877Dec 17, 2025Updated 3 months ago
- ContextGem: Effortless LLM extraction from documents☆1,815Mar 16, 2026Updated last week
- Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing s…☆5,996Updated this week
- CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate resear…☆453Mar 10, 2026Updated 2 weeks ago
- Search movies using RAG and LLMs☆19Sep 4, 2024Updated last year
- Speakr is a personal, self-hosted web application designed for transcribing audio recordings☆2,898Updated this week
- A MCP server allowing LLM agents to easily connect and retrieve data from any database☆99Aug 1, 2025Updated 7 months ago
- Self-hosted web app that gamifies household chores — RPG-style health bars, coins, streaks & family leaderboard☆79Updated this week
- ☆39Aug 4, 2025Updated 7 months ago
- Pyloid: Electron for Python Developer • Modern Web-based desktop app framework☆507Nov 8, 2025Updated 4 months ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,322Updated this week
- VidCrop is a video cropping tool that helps you easily crop video files. You can manually select any part of the video to crop, with an i…☆29Jan 5, 2026Updated 2 months ago
- Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, a…☆4,710Updated this week
- Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!☆6,571Updated this week
- Open source alternative to NotebookLM for teams. Join our Discord: https://discord.gg/ejRNvftDp9☆13,360Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,702Jul 20, 2025Updated 8 months ago
- Get clean data from tricky documents, powered by vision-language models ⚡☆1,524Mar 3, 2026Updated 3 weeks ago
- Application for Math formula detection in image/pdf and then recognition☆12Jan 14, 2025Updated last year
- Ever been told to RTFM only to find there is no FM to R? MCP-RTFM helps you CREATE the F*ing Manual that people keep telling everyone to …☆35Feb 18, 2025Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,477Mar 1, 2026Updated 3 weeks ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆34Feb 12, 2025Updated last year
- Build Real-Time Knowledge Graphs for AI Agents☆24,063Updated this week
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,504Updated this week
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,142Updated this week
- A flexible, adaptive classification system for dynamic text classification☆540Oct 7, 2025Updated 5 months ago