NanoNets / docstrangeLinks
Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
☆1,335Updated 3 months ago
Alternatives and similar repositories for docstrange
Users that are interested in docstrange are comparing it to the libraries listed below
Sorting:
- MAESTRO is an AI-powered research application designed to streamline complex research tasks.☆1,433Updated 3 months ago
- CommonForms — open models to auto-detect PDF form fields☆961Updated 2 months ago
- The easiest way to build apps from your Python code☆582Updated 3 months ago
- ContextGem: Effortless LLM extraction from documents☆1,777Updated last month
- A fully open-source, LlamaCloud-backed alternative to NotebookLM☆1,764Updated 5 months ago
- Open-source spreadsheets platform for deep research and document processing☆368Updated 4 months ago
- ☆1,089Updated 3 months ago
- Turn your data into shareable RAG apps in minutes. All in pure Markdown. Zero boilerplate.☆817Updated last week
- An open-source Text2SQL tool that transforms natural language into SQL using graph-powered schema understanding. Ask your database questi…☆319Updated this week
- Building blocks for rapid development of GenAI applications☆1,615Updated this week
- Build, enrich, and transform datasets using AI models with no code☆1,623Updated 3 months ago
- An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)☆1,851Updated 5 months ago
- AI prompt engineering workbench for crafting, testing, and systematically evaluating prompts with powerful analysis tools.☆769Updated 6 months ago
- ☆431Updated last month
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆629Updated 8 months ago
- Tensorlake is a Document Ingestion API and a serverless platform for building data processing and orchestration APIs☆878Updated this week
- Open-source AI-powered data science platform.☆491Updated 3 months ago
- Graph powered context harness for AI agents☆1,161Updated this week
- Chat with your data - with memory, rules, and observability built in. Deploy in 2 minutes☆412Updated this week
- ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)☆481Updated 5 months ago
- OCR Benchmark☆613Updated 3 months ago
- A list of AI memory projects☆632Updated last year
- Laddr is a python framework for building multi-agent systems where agents communicate, delegate tasks, and execute work in parallel. Thin…☆337Updated 2 months ago
- RAGLight is a modular framework for Retrieval-Augmented Generation (RAG). It makes it easy to plug in different LLMs, embeddings, and vec…☆636Updated 3 weeks ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,448Updated 9 months ago
- 🤖 An open-source AI assistant answering questions using your docs☆243Updated last month
- Python package and backend for the Elysia platform app.☆1,876Updated last week
- Edit PDF files with Nano Banana☆1,030Updated 2 months ago
- Local debugging agent that runs in your terminal☆396Updated 8 months ago
- Simple AI mind mapping for research, right in your browser☆239Updated 2 weeks ago