Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
☆1,412Oct 31, 2025Updated 5 months ago
Alternatives and similar repositories for docstrange
Users that are interested in docstrange are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,960Mar 17, 2026Updated last month
- CLI tool for managing Model Context Protocol (MCP) servers in one place & using them across them different clients☆25Apr 23, 2025Updated 11 months ago
- ☆1,333Updated this week
- MAESTRO is an AI-powered research application designed to streamline complex research tasks.☆1,456Updated this week
- Modern, type-safe, zero-dependency Python library for serial port I/O access☆23Dec 16, 2025Updated 4 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Add AI capabilities to your file system using Ollama, Groq, OpenAi and other's api☆208Jan 4, 2025Updated last year
- The most accurate document search and store for building AI apps☆3,573Apr 2, 2026Updated 2 weeks ago
- Get your documents ready for gen AI☆57,709Updated this week
- A next-gen Python plotting library with SVG-first rendering, interactivity, themes, and clean defaults — better than matplotlib.pyplot☆210Updated this week
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Sep 5, 2024Updated last year
- A Multi-Agentic AI Assistant/Builder☆26Jan 23, 2026Updated 2 months ago
- ☆33Jun 15, 2025Updated 10 months ago
- Simple CLI for managing Postgres databases in Flask.☆21May 26, 2024Updated last year
- Plugboard is an event driven modelling and orchestration framework in Python for simulating and driving complex processes with many inter…☆28Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🔍📃 LLM-powered PDF Table Extractor☆19Jun 26, 2025Updated 9 months ago
- Speakr is a personal, self-hosted web application designed for transcribing audio recordings☆2,937Mar 19, 2026Updated last month
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆8,886Mar 25, 2026Updated 3 weeks ago
- Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing s…☆6,287Apr 10, 2026Updated last week
- Enable tool/function calling for any LLM, in OpenAI and Ollama API formats, adding universal function calling to models without native su…☆76Dec 9, 2025Updated 4 months ago
- ContextGem: Effortless LLM extraction from documents☆1,824Mar 16, 2026Updated last month
- Search movies using RAG and LLMs☆19Sep 4, 2024Updated last year
- A MCP server allowing LLM agents to easily connect and retrieve data from any database☆99Aug 1, 2025Updated 8 months ago
- CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate resear…☆467Mar 10, 2026Updated last month
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Self-hosted web app that gamifies household chores — RPG-style health bars, coins, streaks & family leaderboard☆90Apr 7, 2026Updated last week
- ☆39Aug 4, 2025Updated 8 months ago
- Pyloid: Electron for Python Developer • Modern Web-based desktop app framework☆511Nov 8, 2025Updated 5 months ago
- Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, a…☆4,755Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,395Apr 8, 2026Updated last week
- Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!☆6,889Updated this week
- Get clean data from tricky documents, powered by vision-language models ⚡☆1,522Mar 25, 2026Updated 3 weeks ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,706Jul 20, 2025Updated 8 months ago
- An open source, privacy focused alternative to NotebookLM for teams with no data limit's. Join our Discord: https://discord.gg/ejRNvftDp9☆13,844Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A picky test selector 🧐☆66Aug 4, 2025Updated 8 months ago
- Ever been told to RTFM only to find there is no FM to R? MCP-RTFM helps you CREATE the F*ing Manual that people keep telling everyone to …☆35Apr 8, 2026Updated last week
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,543Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,588Apr 10, 2026Updated last week
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆34Feb 12, 2025Updated last year
- Build Real-Time Knowledge Graphs for AI Agents☆24,798Apr 8, 2026Updated last week
- Awesome AI Benchmarks☆29Jan 16, 2026Updated 3 months ago