VikParuchuri / marker
Convert PDF to markdown + JSON quickly with high accuracy
☆20,555Updated this week
Alternatives and similar repositories for marker:
Users that are interested in marker are comparing it to the libraries listed below
- OCR, layout analysis, reading order, table recognition in 90+ languages☆16,189Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,402Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆7,792Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆25,340Updated this week
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥☆28,154Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆21,760Updated 3 weeks ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆22,315Updated this week
- Get your documents ready for gen AI☆20,730Updated this week
- Python scraper based on AI☆17,993Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆6,787Updated last week
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.☆14,427Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆22,517Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆6,130Updated 3 months ago
- Agno is a lightweight framework for building multi-modal Agents☆18,785Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆18,642Updated 3 months ago
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆25,593Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆26,435Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆38,847Updated this week
- PDF to Markdown with vision models☆9,491Updated this week
- LLM based autonomous agent that conducts deep local and web research on any topic and generates a long report with citations.☆17,949Updated this week
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.☆13,085Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆19,747Updated last week
- DSPy: The framework for programming—not prompting—language models☆21,807Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆34,518Updated this week
- Drag & drop UI to build your customized LLM flow☆35,001Updated this week
- ⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat…☆23,107Updated this week
- Enhanced ChatGPT Clone: Features Agents, DeepSeek, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, …☆21,786Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆39,360Updated this week
- An open-source RAG-based tool for chatting with your documents.☆20,946Updated last week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆10,101Updated this week