VikParuchuri / marker
Convert PDF to markdown quickly with high accuracy
☆17,845Updated this week
Related projects ⓘ
Alternatives and complementary repositories for marker
- OCR, layout analysis, reading order, table recognition in 90+ languages☆14,240Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆6,985Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆23,277Updated this week
- Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory☆18,263Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆13,971Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆18,840Updated this week
- DSPy: The framework for programming—not prompting—language models☆18,885Updated this week
- Get your documents ready for gen AI☆9,923Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆19,247Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆9,182Updated this week
- The Memory layer for your AI apps☆22,875Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆13,436Updated 3 weeks ago
- Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.☆15,471Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…☆25,925Updated this week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,677Updated 2 months ago
- PDF to Markdown with vision models☆6,324Updated this week
- LlamaIndex is a data framework for your LLM applications☆36,820Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆30,423Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆5,648Updated 2 weeks ago
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆15,527Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆10,734Updated last week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆17,775Updated this week
- Gen-AI Chat for Teams - Think ChatGPT if it had access to your team's unique knowledge.☆10,666Updated this week
- An open-source RAG-based tool for chatting with your documents.☆17,436Updated this week
- Python scraper based on AI☆15,802Updated this week
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.☆27,384Updated this week
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.☆11,989Updated last week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆6,347Updated last week
- Go ahead and axolotl questions☆7,930Updated this week
- the AI-native open-source embedding database☆15,448Updated this week