Nutlope / llama-ocr
Document to Markdown OCR library with Llama 3.2 vision
☆2,289Updated 3 months ago
Alternatives and similar repositories for llama-ocr
Users that are interested in llama-ocr are comparing it to the libraries listed below
Sorting:
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,575Updated 2 weeks ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,152Updated this week
- Detect and extract tables to markdown and csv☆745Updated 3 months ago
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,665Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆945Updated 2 weeks ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,247Updated this week
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…☆4,334Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLite☆938Updated this week
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…☆853Updated 7 months ago
- Lightweight library for scraping web-sites with LLMs☆1,083Updated 3 weeks ago
- Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer…☆3,237Updated last week
- ☆1,507Updated 2 months ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) int…☆579Updated 2 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆544Updated last month
- ☆1,255Updated 3 weeks ago
- ContextGem: Effortless LLM extraction from documents☆914Updated last week
- napkins.dev – from screenshot to app☆1,297Updated last month
- Company Researcher tool helps you instantly understand any company inside out.☆1,187Updated 2 weeks ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆7,971Updated this week
- Open source multi-modal RAG for building AI apps over private knowledge.☆2,266Updated this week
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆915Updated 3 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆6,416Updated 2 months ago
- The SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆2,110Updated last week
- A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.☆1,229Updated last month
- Sample apps to help developers get started with Structured Outputs☆636Updated 4 months ago
- AI-powered multi-agent builder☆2,787Updated this week
- Sim Studio is an open-source AI agent workflow builder. Sim Studio's interface is a lightweight, intuitive way to quickly build and deplo…☆3,318Updated this week
- Turn any webpage into structured data using LLMs☆4,837Updated 8 months ago
- A Python package that makes it easy for developers to create AI apps powered by various AI providers.☆1,605Updated last month
- Local realtime voice AI☆2,290Updated 2 months ago