yigitkonur / swift-ocr-llm-powered-pdf-to-markdown
An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from complex PDF documents. Ideal for businesses seeking efficient document digitization and data extraction solutions.
☆688Updated last month
Related projects ⓘ
Alternatives and complementary repositories for swift-ocr-llm-powered-pdf-to-markdown
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.☆486Updated this week
- Export your personal data in one click☆935Updated this week
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆789Updated this week
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆699Updated this week
- Vision model based document ingestion☆1,226Updated this week
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆208Updated last month
- ☆412Updated last month
- DOM to Semantic-Markdown for use with LLMs☆667Updated last month
- Create mind maps to learn new things using AI.☆462Updated last week
- Browser automation system that uses AI-driven planning to navigate web pages and perform goals.☆565Updated this week
- ai for jq☆234Updated last month
- Open-Source Web Automation library with any LLM☆1,508Updated this week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆714Updated 3 months ago
- Laminar - open-source all-in-one platform for engineering AI products. Traces, Evals, Datasets, Labels. YC S24.☆972Updated this week
- Chat with any codebase in under two minutes | Fully local or via third-party APIs☆1,034Updated this week
- OpenCV+YOLO+LLAVA powered video surveillance system☆687Updated 3 weeks ago
- 🪄 Create rich visualizations with AI☆1,223Updated this week
- Claude Memory: Long-term memory for Claude☆347Updated this week
- Detect and extract tables to markdown and csv☆617Updated this week
- The open-source AI-native IDE☆416Updated this week
- The only fully local production-grade Super SDK that provides a simple, unified, and powerful interface for calling more than 200+ LLMs.☆359Updated 2 weeks ago
- With one command, create a natural-sounding audiobook from a variety of input formats (epub, mobi, txt, PDF, HTML and more!)☆571Updated 3 weeks ago
- Open-Source Grammarly Alternative☆1,369Updated last week
- Detect whether or not an audio file was generated by NotebookLM☆119Updated 2 weeks ago
- An open source voice-enabled, compact, empathic AI hardware + software 🤖 framework for companionship, entertainment, education, pediatri…☆399Updated this week
- ☆162Updated 4 months ago
- Dropbase helps developers build and prototype web apps faster with AI. Dropbase is local-first and self hosted.☆1,084Updated last month
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,166Updated 2 months ago
- Things you can do with the token embeddings of an LLM☆1,311Updated this week
- Visualise your CSV files in seconds without sending your data anywhere☆94Updated this week