Nutlope / llama-ocr
Document to Markdown OCR library with Llama 3.2 vision
☆2,262Updated 3 months ago
Alternatives and similar repositories for llama-ocr:
Users that are interested in llama-ocr are comparing it to the libraries listed below
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,546Updated last week
- SOTA Open-Source Browser Agent for autonomously performing complex tasks on the web☆1,548Updated this week
- ☆1,464Updated last month
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wi…☆4,239Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,124Updated this week
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing an…☆848Updated 7 months ago
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other ent…☆2,653Updated this week
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents☆5,116Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆895Updated 2 months ago
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercel…☆7,761Updated last week
- Detect and extract tables to markdown and csv☆742Updated 3 months ago
- Instagram Ai Agent 🌸 is built using Node.js and TypeScript 🛠️, designed for seamless job execution 📸. It's lightweight, efficient, and…☆3,179Updated 2 months ago
- Fetch an entire site and save it as a text file (to be used with AI models).☆1,380Updated 3 months ago
- Open source multi-modal RAG for building AI apps over private knowledge.☆1,667Updated this week
- Convert any PDF into a podcast episode!☆2,221Updated 4 months ago
- Enable AI models for video production in the browser☆1,616Updated last month
- A fast tool to convert any website into LLM-ready markdown data. Built by https://supermemory.ai☆1,305Updated 9 months ago
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) int…☆552Updated last month
- Things you can do with the token embeddings of an LLM☆1,437Updated 3 weeks ago
- Open source Claude Artifacts – built with Llama 3.1 405B☆5,923Updated 2 weeks ago
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆4,611Updated 2 weeks ago
- Sample apps to help developers get started with Structured Outputs☆629Updated 3 months ago
- OCR Benchmark☆464Updated last week
- A fast multimodal LLM for real-time voice☆3,855Updated 2 months ago
- Turn any webpage into structured data using LLMs☆4,762Updated 7 months ago
- Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.☆3,738Updated last week
- Knowledge Agents and Management in the Cloud☆3,906Updated this week
- ☆3,474Updated 5 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,205Updated this week
- Lightpanda: the headless browser designed for AI and automation☆8,583Updated this week