wjbmattingly / dots.ocrLinks
Multilingual Document Layout Parsing in a Single Vision-Language Model
β30Updated last month
Alternatives and similar repositories for dots.ocr
Users that are interested in dots.ocr are comparing it to the libraries listed below
Sorting:
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β327Updated 3 months ago
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+teβ¦β279Updated last week
- β113Updated 9 months ago
- β59Updated 5 months ago
- Extract structured text from pdfs quicklyβ589Updated 2 months ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.β64Updated 10 months ago
- 90% of what you need for LLM app development. Nothing you don't.β265Updated last week
- Structured information extraction from documentsβ317Updated 11 months ago
- Convert a web page to markdownβ77Updated last year
- Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intβ¦β503Updated this week
- Join 15k builders to the Real-World ML Newsletter β¬οΈβ¬οΈβ¬οΈβ48Updated last year
- β19Updated 6 months ago
- Fine tune Gemma 3 on an object detection taskβ79Updated last month
- AI powered local typing assistant built with Ollamaβ316Updated last year
- Open Source Note GPT. Turn your photos and images into text notes (in obsidian)β94Updated 6 months ago
- A bit of extra usability for sqliteβ209Updated last month
- TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by infβ¦β202Updated 3 months ago
- β140Updated 3 weeks ago
- Claudette is Claude's friendβ265Updated 3 weeks ago
- Simple UI for debugging correlations of text embeddingsβ290Updated 3 months ago
- ShellSage saves sysadminsβ sanity by solving shell script snafus super swiftlyβ363Updated 2 months ago
- Makes it easy to use altair from FastHTMLβ26Updated 10 months ago
- Packages whisper.cpp into pre-built, pip-installable wheels, for macOS and Linux.β174Updated last year
- Minimal example of MCP for parsing llms.txtβ40Updated 4 months ago
- groq-gradioβ18Updated 3 months ago
- β210Updated 2 months ago
- Porting Shadcn-ui components to python/js for use with fasthtmlβ160Updated 9 months ago
- Qwen 2.5 Coder 1.5B with Code Interpreterβ286Updated 10 months ago
- Unattended Lightweight Text Classifiers with LLM Embeddingsβ183Updated 11 months ago
- A fork of sqlite-utils with CLI etc removedβ16Updated 5 months ago