Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
☆1,477Oct 31, 2025Updated 6 months ago
Alternatives and similar repositories for docstrange
Users that are interested in docstrange are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆2,018Mar 17, 2026Updated 2 months ago
- ☆1,364Updated this week
- MAESTRO is an AI-powered research application designed to streamline complex research tasks.☆1,492Apr 16, 2026Updated last month
- Modern, type-safe, zero-dependency Python library for serial port I/O access☆23Dec 16, 2025Updated 5 months ago
- CLI tool for managing Model Context Protocol (MCP) servers in one place & using them across them different clients☆25Apr 23, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Add AI capabilities to your file system using Ollama, Groq, OpenAi and other's api☆210Jan 4, 2025Updated last year
- walterra's collections of helpers for agentic coding☆34Mar 23, 2026Updated 2 months ago
- Get your documents ready for gen AI☆59,909Updated this week
- The most accurate document search and store for building AI apps☆3,599May 11, 2026Updated last week
- A next-gen Python plotting library with SVG-first rendering, interactivity, themes, and clean defaults — better than matplotlib.pyplot☆213Apr 30, 2026Updated 3 weeks ago
- A Multi-Agentic AI Assistant/Builder☆27May 15, 2026Updated last week
- Simple CLI for managing Postgres databases in Flask.☆21May 26, 2024Updated last year
- A simple PDF viewer created with PyQt6 that you can use by itself or incorporate in other scripts. Hard to find!☆17Mar 6, 2025Updated last year
- The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.☆8,983Mar 25, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🔍📃 LLM-powered PDF Table Extractor☆19Jun 26, 2025Updated 10 months ago
- ContextGem: Effortless LLM extraction from documents☆1,844May 7, 2026Updated 2 weeks ago
- Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing s…☆6,483May 15, 2026Updated last week
- Enable tool/function calling for any LLM, in OpenAI and Ollama API formats, adding universal function calling to models without native su…☆76Dec 9, 2025Updated 5 months ago
- Speakr is a personal, self-hosted web application designed for transcribing audio recordings☆3,114May 9, 2026Updated 2 weeks ago
- Search movies using RAG and LLMs☆19Sep 4, 2024Updated last year
- A MCP server allowing LLM agents to easily connect and retrieve data from any database☆99Aug 1, 2025Updated 9 months ago
- Ipython notebook copy of Andrej Karpathy's llama2.c☆23Sep 5, 2023Updated 2 years ago
- ☆39Aug 4, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Self-hosted web app that gamifies household chores — RPG-style health bars, coins, streaks & family leaderboard☆100Updated this week
- Build, Evaluate, and Optimize AI Systems. Includes evals, RAG, agents, fine-tuning, synthetic data generation, dataset management, MCP, a…☆4,843Updated this week
- Pyloid: Electron for Python Developer • Modern Web-based desktop app framework☆510Nov 8, 2025Updated 6 months ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,577May 12, 2026Updated last week
- CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate resear…☆483Mar 10, 2026Updated 2 months ago
- build llama inference compute from scrath, only using torch/numpy base ops☆15May 5, 2026Updated 2 weeks ago
- Get clean data from tricky documents, powered by vision-language models ⚡☆1,528Mar 25, 2026Updated last month
- An open source, privacy focused alternative to NotebookLM for teams with no data limits. Join our Discord: https://discord.gg/ejRNvftDp9☆14,275Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,756May 6, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Incremental engine for long horizon agents 🌟 Star if you like it!☆9,941Updated this week
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,721Jul 20, 2025Updated 10 months ago
- Ever been told to RTFM only to find there is no FM to R? MCP-RTFM helps you CREATE the F*ing Manual that people keep telling everyone to …☆35Apr 20, 2026Updated last month
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆35Feb 12, 2025Updated last year
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,585Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆26,122May 14, 2026Updated last week
- Awesome AI Benchmarks☆32Jan 16, 2026Updated 4 months ago