iamarunbrahma / pdf-to-markdownLinks
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing.
☆79Updated 6 months ago
Alternatives and similar repositories for pdf-to-markdown
Users that are interested in pdf-to-markdown are comparing it to the libraries listed below
Sorting:
- Reliable RAG setup that uses Semantic Double Merging Chunking from llamaindex, Qdrant Hybrid Search, colBERT for reranking and Google Gem…☆38Updated 5 months ago
- A set of re-usable AI agent for document processing☆87Updated 5 months ago
- A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with o…☆52Updated 5 months ago
- AI Document Assistant☆79Updated 2 months ago
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆72Updated 8 months ago
- Example Pipelines for Open-WebUI☆75Updated 3 months ago
- WebRAgent is a retrieval-augmented generation (RAG) web application featuring agent-based query decomposition, vector search with Qdrant,…☆43Updated 2 months ago
- An advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs…☆37Updated 8 months ago
- Parse PDFs into markdown using Vision LLMs☆379Updated 3 months ago
- ☆33Updated 5 months ago
- Adaptive Modular Network (AMN) a potentially novel machine learning architecture capable of producing models which can learn at inference…☆52Updated 2 months ago
- Chat with PDF files with source highlights☆138Updated 6 months ago
- A fully custom chatbot built with Agentic RAG (Retrieval-Augmented Generation), combining Gemini models with a local knowledge base for a…☆142Updated 3 months ago
- ☆48Updated 11 months ago
- Ingesting GraphRAG from microsoft into Neo4j for local visualisation. Using their Local and Global search and comparing the results in a …☆27Updated 7 months ago
- ☆59Updated 4 months ago
- ☆41Updated 2 months ago
- Long-Term Memory & Context Management for LLMs☆44Updated last week
- Automate complex business workflows with our Multi-AI-Agent Systems using crewAI. This framework leverages autonomous, role-specific AI a…☆94Updated last year
- A curated list of tools related to notebooklm as well as examples of great podcasts generated by notebooklm☆63Updated 7 months ago
- LangGraph-GUI backend with fastapi☆54Updated last week
- A MCP server connecting to managed indexes on LlamaCloud☆76Updated last month
- An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intellig…☆53Updated 10 months ago
- A sophisticated biologically inspired memory system for AI agents. Provides organic, high quality, persistent memory with self-maintenanc…☆42Updated last week
- A MCP Server that will download any webpage as markdown in an instant. Download docs straight to your IDE for AI context. Powered by Jina…☆26Updated 2 weeks ago
- Contextual Doc Retrieval is a Python-based system leveraging OpenAI GPT-4o and Cohere for re-ranking and query expansion, combined with B…☆47Updated 7 months ago
- Open-Source RAG app with LLM Observability (Langfuse), support for 100+ providers (LiteLLM), Dockerized, Full Type-checking, 100% Test co…☆154Updated 3 months ago
- Example Agent Applications by Relari.ai☆62Updated 2 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆120Updated 7 months ago
- learning resource of langgraph for dummy☆113Updated 4 months ago