iamarunbrahma / pdf-to-markdownLinks
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing.
☆103Updated last year
Alternatives and similar repositories for pdf-to-markdown
Users that are interested in pdf-to-markdown are comparing it to the libraries listed below
Sorting:
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆158Updated 3 months ago
- like firecrawl.dev but free☆50Updated 9 months ago
- A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted…☆174Updated 5 months ago
- Chat with PDF files with source highlights☆149Updated last year
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆51Updated 11 months ago
- An Automated AI-Powered Prompt Optimization Framework☆207Updated last year
- MCP server for enabling LLM applications to perform deep research via the MCP protocol☆289Updated last month
- ☆73Updated last year
- WebRAgent is a retrieval-augmented generation (RAG) web application featuring agent-based query decomposition, vector search with Qdrant,…☆53Updated 8 months ago
- A Multi-Agent AI Tool that creates beautiful presentations with voice-overs 🎦🔥☆181Updated 9 months ago
- A framework for agentic workflow creation and deployment☆254Updated 10 months ago
- A set of re-usable AI agent for document processing☆97Updated 11 months ago
- Parse PDFs into markdown using Vision LLMs☆449Updated 2 months ago
- Using GPT-4 Vision and GPT-4 Turbo, take a PDF as input and get a markdown file as output.☆98Updated 10 months ago
- ☆89Updated last year
- An alternative AI assistant for Microsoft Office that works with your favorite LLM API☆78Updated this week
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆84Updated last year
- Graphy v1: A Realtime GraphRAG App using Langchain, Neo4j, GPT-4o, and Streamlit.☆69Updated last year
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆277Updated this week
- Generate full fledged PDF reports using LLMs like GPT, Claude, Llama☆64Updated last year
- Custom Websearch Agent Built with Local Models, vLLM, and OpenAI☆135Updated last year
- The Langflow Embedded Chat is a powerful web component that enables seamless communication with Langflow☆217Updated 2 weeks ago
- Collection of rivet examples to get you going! (scroll down for more information)☆65Updated last year
- A simple script that can run in the background, uses the whisper API to transcribe text into ANY application☆94Updated last year
- ☆104Updated this week
- AI Document Assistant☆88Updated 6 months ago
- A VSCode extension that generates markdown documentation from web pages and GitHub repositories.☆56Updated 10 months ago
- Corrective RAG demo powerd by Ollama☆108Updated last year
- A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with o…☆65Updated 11 months ago
- Sample .prompt files to use with Continue☆107Updated last year