iamarunbrahma / pdf-to-markdownLinks
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing.
☆84Updated 7 months ago
Alternatives and similar repositories for pdf-to-markdown
Users that are interested in pdf-to-markdown are comparing it to the libraries listed below
Sorting:
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆72Updated 8 months ago
- A set of re-usable AI agent for document processing☆91Updated 5 months ago
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆44Updated 5 months ago
- LangGraph-GUI backend with fastapi☆56Updated 3 weeks ago
- Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumb…☆92Updated this week
- Chat with PDF files with source highlights☆142Updated 6 months ago
- Visual node-edge graph GUI editor for LangGraph and run with local LLM or online API☆185Updated 2 weeks ago
- It is a API responsible to intermediate the comunication between Langflow and Streamlit applications.☆32Updated 10 months ago
- Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV files. Uses langchain, Ollama, Groq, Gemini,…☆55Updated last year
- PyMuPDF4LLM for Data Extraction. Build better and efficient RAG.☆34Updated 8 months ago
- An advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs…☆37Updated 8 months ago
- Enhanced MCP server for deep web research☆67Updated 3 months ago
- An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intellig…☆54Updated 10 months ago
- Corrective RAG demo powerd by Ollama☆100Updated last year
- Reliable RAG setup that uses Semantic Double Merging Chunking from llamaindex, Qdrant Hybrid Search, colBERT for reranking and Google Gem…☆39Updated 6 months ago
- ☆89Updated last year
- A MCP server connecting to managed indexes on LlamaCloud☆78Updated this week
- ☆38Updated last year
- Human-AI collaboration to produce a newstory about a meeting from minutes or transcript☆193Updated 6 months ago
- Groqqle is a powerful web search and content summarization tool built with Python, leveraging Groq's LLM API for advanced natural languag…☆144Updated 3 months ago
- An agentic AI application that allows you to chat with your papers and gather also information from papers on ArXiv and on PubMed☆135Updated last month
- Streamlined ingest using unstructured.io calls to partition, enrich and the chunk a complex PDF☆14Updated 7 months ago
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆119Updated 10 months ago
- ☆85Updated 7 months ago
- Example Pipelines for Open-WebUI☆76Updated 3 months ago
- Open Deep Researcher with openai compatible endpoint, now completely local with ollama, local playwright via searxng with citations and p…☆118Updated 2 months ago
- like firecrawl.dev but free☆46Updated 3 months ago
- Docling with Ollama - RAG on Local Files with Local Models☆67Updated 5 months ago
- A tool for querying and interacting with PDF documents using AI. This application uses natural language processing to provide contextuall…☆113Updated 3 months ago
- ☆67Updated 6 months ago