yigitkonur / swift-ocr-llm-powered-pdf-to-markdown
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from complex PDF documents. Ideal for businesses seeking efficient document digitization and data extraction solutions.
☆812Updated 3 months ago
Alternatives and similar repositories for swift-ocr-llm-powered-pdf-to-markdown:
Users that are interested in swift-ocr-llm-powered-pdf-to-markdown are comparing it to the libraries listed below
- A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.☆837Updated last week
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆506Updated this week
- Open-source framework for exporting your personal data.☆1,401Updated 3 weeks ago
- Vision model based document ingestion☆1,302Updated this week
- Detect and extract tables to markdown and csv☆711Updated last week
- Visualise your CSV files in seconds without sending your data anywhere☆464Updated last week
- ☆430Updated 3 months ago
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆295Updated 2 weeks ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆738Updated 5 months ago
- Open-source platform for extracting structured data from documents using AI.☆1,183Updated this week
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆693Updated last week
- DOM to Semantic-Markdown for use with LLMs☆710Updated last week
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆368Updated 2 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆789Updated 3 weeks ago
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tid…☆2,159Updated this week
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆2,249Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite☆678Updated this week
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆471Updated 4 months ago
- Create mind maps to learn new things using AI.☆522Updated 2 months ago
- High-performance retrieval engine for unstructured data☆1,110Updated this week
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆213Updated 3 weeks ago
- Convert any PDF into a podcast episode!☆653Updated 2 months ago
- The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.☆400Updated this week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,298Updated 4 months ago
- ☆276Updated 3 weeks ago
- Algolia alternative for technical docs☆464Updated 2 months ago
- An Open Source implementation of Notebook LM with more flexibility and features☆869Updated last month
- ai for jq☆236Updated 3 months ago
- No-code ETL and data pipelines with AI and NLP☆275Updated 2 months ago
- Company Researcher tool helps you instantly understand any company inside out.☆690Updated last week