iamarunbrahma / pdf-to-markdownLinks
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text, tables, and images with preserved formatting for enhanced information retrieval and processing.
☆105Updated last year
Alternatives and similar repositories for pdf-to-markdown
Users that are interested in pdf-to-markdown are comparing it to the libraries listed below
Sorting:
- like firecrawl.dev but free☆50Updated 10 months ago
- A document analysis tool built with Streamlit and Microsoft MarkItDown. Extract and analyze content from multiple document formats with o…☆64Updated last year
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆159Updated 4 months ago
- A simple script that can run in the background, uses the whisper API to transcribe text into ANY application☆98Updated last year
- learning resource of langgraph for dummy☆147Updated 11 months ago
- Groqqle is a powerful web search and content summarization tool built with Python, leveraging Groq's LLM API for advanced natural languag…☆148Updated 10 months ago
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆51Updated last year
- Automatically generate engaging AI podcasts from nothing but an episode title.☆142Updated 5 months ago
- A set of re-usable AI agent for document processing☆97Updated last year
- Example Pipelines for Open-WebUI☆83Updated 10 months ago
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆85Updated last year
- A framework for agentic workflow creation and deployment☆256Updated 11 months ago
- A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted…☆180Updated 6 months ago
- Chat with PDF files with source highlights☆150Updated last year
- SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file a…☆77Updated last year
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆210Updated 6 months ago
- An advanced retrieval system that combines semantic vector search with token-based search, using contextual chunking and knowledge graphs…☆45Updated last year
- GroqCasters is a Python application that generates podcast scripts and corresponding audio using AI technologies. It leverages PocketGroq…☆133Updated last year
- Find your files with natural language and ask questions.☆58Updated last month
- A fork of OpenAI Swarm that supports Groq and Anthropic☆125Updated 10 months ago
- An Automated AI-Powered Prompt Optimization Framework☆208Updated last year
- LangGraph-GUI backend with fastapi☆61Updated 2 months ago
- ☆74Updated last year
- ☆69Updated 11 months ago
- Sample .prompt files to use with Continue☆108Updated last year
- This repository contains custom pipelines developed for the OpenWebUI framework, including advanced workflows such as long-term memory fi…☆80Updated 8 months ago
- Collection of rivet examples to get you going! (scroll down for more information)☆66Updated last year
- Corrective RAG demo powerd by Ollama☆109Updated last year
- Declarative framework to build LLM-based applications☆130Updated last year
- A tool for querying and interacting with PDF documents using AI. This application uses natural language processing to provide contextuall…☆126Updated 10 months ago