iamarunbrahma / pdf-to-markdownLinks
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing.
☆92Updated 9 months ago
Alternatives and similar repositories for pdf-to-markdown
Users that are interested in pdf-to-markdown are comparing it to the libraries listed below
Sorting:
- ☆67Updated 9 months ago
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆133Updated last week
- Long-Term Memory & Context Management for LLMs☆66Updated 2 weeks ago
- An advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs…☆40Updated 11 months ago
- Chat with PDF files with source highlights☆145Updated 8 months ago
- Groqqle is a powerful web search and content summarization tool built with Python, leveraging Groq's LLM API for advanced natural languag…☆146Updated 5 months ago
- like firecrawl.dev but free☆48Updated 6 months ago
- A set of re-usable AI agent for document processing☆93Updated 8 months ago
- Reliable RAG setup that uses Semantic Double Merging Chunking from llamaindex, Qdrant Hybrid Search, colBERT for reranking and Google Gem…☆41Updated 8 months ago
- MCP server for enabling LLM applications to perform deep research via the MCP protocol☆235Updated 2 months ago
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆48Updated 8 months ago
- Corrective RAG demo powerd by Ollama☆104Updated last year
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆200Updated 2 months ago
- A framework for agentic workflow creation and deployment☆250Updated 7 months ago
- An Automated AI-Powered Prompt Optimization Framework☆199Updated last year
- AI Document Assistant☆82Updated 2 months ago
- A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted…☆151Updated 2 months ago
- ☆111Updated last year
- Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV files. Uses langchain, Ollama, Groq, Gemini,…☆55Updated last year
- LangGraph-GUI backend with fastapi☆59Updated 3 months ago
- CoexistAI is a modular, developer-friendly research assistant framework . It enables you to build, search, summarize, and automate resear…☆220Updated this week
- Automatically generate engaging AI podcasts from nothing but an episode title.☆126Updated last month
- Docling with Ollama - RAG on Local Files with Local Models☆73Updated 8 months ago
- Find your files with natural language and ask questions.☆51Updated last week
- Open Deep Researcher with openai compatible endpoint, now completely local with ollama, local playwright via searxng with citations and p…☆136Updated 5 months ago
- Dabarqus is incredibly fast RAG that runs everywhere.☆60Updated 7 months ago
- Vibe-coding tools for the LlamaIndex ecosystem☆71Updated this week
- ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)☆468Updated 2 months ago
- Multimodal Assistant. Human Interface for computers.☆104Updated 10 months ago
- A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with o…☆60Updated 8 months ago