CatchTheTornado / pdf-extract-api
Document (PDF) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
☆1,291Updated this week
Related projects ⓘ
Alternatives and complementary repositories for pdf-extract-api
- Document to Markdown OCR library with Llama 3.2 vision☆1,345Updated last week
- Vision model based document ingestion☆1,242Updated this week
- ☆1,083Updated last month
- A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama☆1,375Updated last week
- A list of software that allows searching the web with the assistance of AI.☆413Updated this week
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆320Updated 2 weeks ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…☆695Updated last month
- LLM-powered Markdown editor☆1,043Updated this week
- Detect and extract tables to markdown and csv☆633Updated this week
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆593Updated this week
- Everything you need to know to build your own RAG application☆625Updated this week
- Prompt optimization scratch☆413Updated this week
- Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...☆903Updated this week
- napkins.dev – from screenshot to app☆963Updated last month
- An AI personal tutor built with Llama 3.1☆1,381Updated 3 months ago
- openperplex is an opensource AI search engine☆755Updated 3 months ago
- SearchGPT / Perplexity clone, but personalised for you.☆948Updated 3 months ago
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.…☆345Updated 2 weeks ago
- The easiest way to get started with LlamaIndex☆1,044Updated this week
- A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.ai☆922Updated 4 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆487Updated this week
- 🪄 Create rich visualizations with AI☆1,326Updated last week
- A minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel AI SDK Search with models like…☆530Updated this week
- Convert any PDF into a podcast episode!☆1,511Updated 2 weeks ago
- Examples of using E2B☆738Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆681Updated this week
- An experimental UI for text-to-knowledge-graph generation☆746Updated 6 months ago
- Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.☆762Updated last month
- AI Meeting Minutes analysis App built with NextJS, Langflow, Groq, and OpenAI☆349Updated last month