iamarunbrahma / pdf-to-markdownLinks
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing.
☆94Updated 10 months ago
Alternatives and similar repositories for pdf-to-markdown
Users that are interested in pdf-to-markdown are comparing it to the libraries listed below
Sorting:
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆201Updated 3 months ago
- ☆69Updated 9 months ago
- Chat with PDF files with source highlights☆146Updated 9 months ago
- ☆124Updated 6 months ago
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆49Updated 8 months ago
- LangGraph-GUI backend with fastapi☆59Updated 3 months ago
- A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted…☆153Updated 2 months ago
- Reliable RAG setup that uses Semantic Double Merging Chunking from llamaindex, Qdrant Hybrid Search, colBERT for reranking and Google Gem…☆41Updated 9 months ago
- Automatically generate engaging AI podcasts from nothing but an episode title.☆128Updated last month
- Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV files. Uses langchain, Ollama, Groq, Gemini,…☆55Updated last year
- Dabarqus is incredibly fast RAG that runs everywhere.☆60Updated 7 months ago
- Corrective RAG demo powerd by Ollama☆106Updated last year
- A set of re-usable AI agent for document processing☆93Updated 8 months ago
- This repository contains custom pipelines developed for the OpenWebUI framework, including advanced workflows such as long-term memory fi…☆75Updated 4 months ago
- A framework for agentic workflow creation and deployment☆251Updated 8 months ago
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆79Updated 11 months ago
- ☆112Updated last year
- TalkNexus: Ollama Chatbot Multi-Model & RAG Interface☆62Updated 6 months ago
- Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with B…☆49Updated 11 months ago
- Groqqle is a powerful web search and content summarization tool built with Python, leveraging Groq's LLM API for advanced natural languag…☆147Updated 6 months ago
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆137Updated 3 weeks ago
- An Automated AI-Powered Prompt Optimization Framework☆202Updated last year
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files☆160Updated 7 months ago
- Co-create a PowerPoint presentation with Generative AI☆264Updated last week
- ☆67Updated 8 months ago
- Vibe-coding tools for the LlamaIndex ecosystem☆126Updated last week
- Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extr…☆217Updated last month
- Graphy v1: A Realtime GraphRAG App using Langchain, Neo4j, GPT-4o, and Streamlit.☆67Updated 11 months ago
- AI Document Assistant☆85Updated 3 months ago
- ☆103Updated 10 months ago