yigitkonur / swift-ocr-llm-powered-pdf-to-markdown

An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing and batching to deliver high-quality text extraction from complex PDF documents. Ideal for businesses seeking efficient document digitization and data extraction solutions.

☆852

Alternatives and similar repositories for swift-ocr-llm-powered-pdf-to-markdown:

Users that are interested in swift-ocr-llm-powered-pdf-to-markdown are comparing it to the libraries listed below

DonTizi / rlama
A powerful document AI question-answering tool that connects to your local Ollama models. Create, manage, and interact with RAG systems f…
☆979Updated last month
lmnr-ai / index
The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web
☆2,037Updated this week
ses4255 / Versatile-OCR-Program
Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)
☆621Updated this week
PragmaticMachineLearning / probly
☆826Updated this week
punnerud / Local_Knowledge_Graph
☆438Updated 7 months ago
clemlesne / scrape-it-now
Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.
☆516Updated 2 months ago
thiswillbeyourgithub / wdoc
Summarize and query from a lot of heterogeneous documents. Any LLM provider, any filetype, scalable (?), WIP
☆451Updated this week
vlm-run / vlmrun-hub
A hub for various industry-specific schemas to be used with VLMs.
☆503Updated this week
getomni-ai / benchmark
OCR Benchmark
☆470Updated 2 weeks ago
Surfer-Org / Protocol
Open-source framework for exporting your personal data.
☆1,429Updated 4 months ago
mohsen1 / llm-debugger-vscode-extension
VSCode extension that demonstrates the use of large language models (LLMs) for active debugging of programs
☆330Updated 2 months ago
DocumindHQ / documind
Open-source platform for extracting structured data from documents using AI.
☆1,303Updated last week
devflowinc / firecrawl-simple
➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…
☆430Updated last month
stuzero / pg-mcp-server
☆442Updated last week
stanford-mast / blast
Browser-LLM Auto-Scaling Technology
☆428Updated this week
vlm-run / vlmrun-cookbook
Examples and guides for using the VLM Run API
☆275Updated this week
Bklieger / ScribeWizard
ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3
☆488Updated 3 months ago
mirth / chonky
Fully neural approach for text chunking
☆341Updated last week
herol3oy / austen
📚 discover story relationships
☆321Updated last week
visprex / visprex
Visualise your CSV files in seconds without sending your data anywhere
☆507Updated last month
VikParuchuri / tabled
Detect and extract tables to markdown and csv
☆743Updated 3 months ago
dhealy05 / frames_of_mind
Animating R1's thoughts.
☆380Updated 2 months ago
lifeiteng / OmniSenseVoice
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
☆837Updated last month
SouthBridgeAI / offmute
An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though
☆543Updated 3 weeks ago
Brandon-c-tech / RAG-logger
RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…
☆222Updated 4 months ago
souzatharsis / tamingLLMs
Taming LLMs: A Practical Guide to LLM Pitfalls with Open Source Software
☆308Updated 3 months ago
codingmoh / open-codex
Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.
☆461Updated this week
JigsawStack / insanely-fast-whisper-api
An API to transcribe audio with OpenAI's Whisper Large v3!
☆271Updated 5 months ago
matiasmolinas / evolving-agents
Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…
☆426Updated 2 weeks ago
romansky / dom-to-semantic-markdown
DOM to Semantic-Markdown for use with LLMs
☆822Updated 3 months ago