yigitkonur / swift-ocr-llm-powered-pdf-to-markdown
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from complex PDF documents. Ideal for businesses seeking efficient document digitization and data extraction solutions.
☆695Updated last month
Related projects ⓘ
Alternatives and complementary repositories for swift-ocr-llm-powered-pdf-to-markdown
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆487Updated this week
- Open-source framework for exporting and building applications off of your personal data.☆938Updated this week
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆734Updated 2 weeks ago
- Visualise your CSV files in seconds without sending your data anywhere☆430Updated last week
- ☆416Updated 2 months ago
- Create mind maps to learn new things using AI.☆478Updated 2 weeks ago
- Vision model based document ingestion☆1,242Updated this week
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆217Updated last week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆718Updated 3 months ago
- Open-source platform for extracting structured data from documents using AI.☆689Updated this week
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)☆580Updated last month
- DOM to Semantic-Markdown for use with LLMs☆673Updated last month
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆1,415Updated this week
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆593Updated this week
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆210Updated last month
- Dropbase helps developers build and prototype web apps faster with AI. Dropbase is local-first and self hosted.☆1,094Updated last month
- ☆446Updated this week
- Detect whether or not an audio file was generated by NotebookLM☆120Updated 3 weeks ago
- ai for jq☆234Updated 2 months ago
- Claude Memory: Long-term memory for Claude☆357Updated last week
- Convert any PDF into a podcast episode!☆597Updated this week
- 🪄 Create rich visualizations with AI☆1,326Updated last week
- Things you can do with the token embeddings of an LLM☆1,376Updated last week
- High-performance retrieval engine for unstructured data☆982Updated last week
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.☆1,547Updated 3 months ago
- RAG that intelligently adapts to your use case, data, and queries☆1,811Updated this week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,186Updated 3 months ago
- Detect and extract tables to markdown and csv☆633Updated this week
- The Open Source Memory Layer For Autonomous Agents☆1,483Updated 3 weeks ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆320Updated 2 weeks ago