iamarunbrahma / pdf-to-markdownLinks
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing.
☆101Updated 11 months ago
Alternatives and similar repositories for pdf-to-markdown
Users that are interested in pdf-to-markdown are comparing it to the libraries listed below
Sorting:
- Chat with PDF files with source highlights☆146Updated 11 months ago
- A set of re-usable AI agent for document processing☆97Updated 10 months ago
- ☆72Updated 11 months ago
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆151Updated 2 months ago
- like firecrawl.dev but free☆49Updated 8 months ago
- A simple script that can run in the background, uses the whisper API to transcribe text into ANY application☆89Updated last year
- A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted…☆161Updated 4 months ago
- 【Star-crossed coders unite!⭐️】Model Context Protocol (MCP) server implementation providing Google News search capabilities via SerpAPI, w…☆89Updated last week
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆81Updated last year
- Automatically generate engaging AI podcasts from nothing but an episode title.☆134Updated 3 months ago
- Groqqle is a powerful web search and content summarization tool built with Python, leveraging Groq's LLM API for advanced natural languag…☆147Updated 8 months ago
- MCP server for enabling LLM applications to perform deep research via the MCP protocol☆276Updated this week
- An Automated AI-Powered Prompt Optimization Framework☆204Updated last year
- ☆55Updated last year
- LangGraph-GUI backend with fastapi☆61Updated 3 weeks ago
- A framework for agentic workflow creation and deployment☆253Updated 9 months ago
- A fork of OpenAI Swarm that supports Groq and Anthropic☆124Updated 8 months ago
- An advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs…☆44Updated last year
- Automating prompt engineering using AI Agents.☆171Updated 2 months ago
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆203Updated 4 months ago
- learning resource of langgraph for dummy☆144Updated 9 months ago
- ☆124Updated 8 months ago
- Open Deep Researcher with openai compatible endpoint, now completely local with ollama, local playwright via searxng with citations and p…☆145Updated 7 months ago
- AI Document Assistant☆87Updated 4 months ago
- Simple, Opinionated benchmark for testing the viability of Efficient Language Models (ELMs) for personal use cases.☆48Updated last year
- Corrective RAG demo powerd by Ollama☆108Updated last year
- ☆112Updated last year
- Declarative framework to build LLM-based applications☆128Updated 11 months ago
- Groq goes brrrrr... so had to make a basic Streamlit app you can build upon!☆83Updated 9 months ago
- Turn Webpage to LLM friendly input text. Similar to Firecrawl and Jina Reader API. Makes RAG, AI web scraping, image & webpage links extr…☆227Updated 2 weeks ago