CatchTheTornado / pdf-extract-api

Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

☆1,291

Related projects ⓘ

Alternatives and complementary repositories for pdf-extract-api

Nutlope / llama-ocr
Document to Markdown OCR library with Llama 3.2 vision
☆1,345Updated last week
lumina-ai-inc / chunkr
Vision model based document ingestion
☆1,242Updated this week
lamm-mit / PDF2Audio
☆1,083Updated last month
itsOwen / CyberScraper-2077
A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama
☆1,375Updated last week
felladrin / awesome-ai-web-search
A list of software that allows searching the web with the assistance of AI.
☆413Updated this week
SouthBridgeAI / offmute
An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though
☆320Updated 2 weeks ago
yigitkonur / swift-ocr-llm-powered-pdf-to-markdown
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…
☆695Updated last month
fynnfluegge / rocketnotes
LLM-powered Markdown editor
☆1,043Updated this week
VikParuchuri / tabled
Detect and extract tables to markdown and csv
☆633Updated this week
theredsix / cerebellum
Browser automation system that uses AI-driven planning to navigate web pages and perform goals.
☆593Updated this week
bRAGAI / bRAG-langchain
Everything you need to know to build your own RAG application
☆625Updated this week
hinthornw / promptimizer
Prompt optimization scratch
☆413Updated this week
RamiAwar / dataline
Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...
☆903Updated this week
Nutlope / napkins
napkins.dev – from screenshot to app
☆963Updated last month
Nutlope / llamatutor
An AI personal tutor built with Llama 3.1
☆1,381Updated 3 months ago
YassKhazzan / openperplex_backend_os
openperplex is an opensource AI search engine
☆755Updated 3 months ago
supermemoryai / opensearch-ai
SearchGPT / Perplexity clone, but personalised for you.
☆948Updated 3 months ago
muratcankoylan / AI-Investigator
An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.…
☆345Updated 2 weeks ago
run-llama / create-llama
The easiest way to get started with LlamaIndex
☆1,044Updated this week
supermemoryai / markdowner
A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.ai
☆922Updated 4 months ago
clemlesne / scrape-it-now
Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.
☆487Updated this week
microsoft / data-formulator
🪄 Create rich visualizations with AI
☆1,326Updated last week
zaidmukaddam / miniperplx
A minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK Search with models like…
☆530Updated this week
gabrielchua / open-notebooklm
Convert any PDF into a podcast episode!
☆1,511Updated 2 weeks ago
e2b-dev / e2b-cookbook
Examples of using E2B
☆738Updated this week
QuivrHQ / MegaParse
File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.
☆681Updated this week
yoheinakajima / prettygraph
An experimental UI for text-to-knowledge-graph generation
☆746Updated 6 months ago
adithya-s-k / marker-api
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
☆762Updated last month
misbahsy / meetingmind
AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI
☆349Updated last month