Nutlope / llama-ocr
Document to Markdown OCR library with Llama 3.2 vision
β2,075Updated last week
Alternatives and similar repositories for llama-ocr:
Users that are interested in llama-ocr are comparing it to the libraries listed below
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documentsβ¦β2,101Updated last week
- π¦ CHONK your texts with Chonkie β¨ - The no-nonsense RAG chunking libraryβ2,338Updated this week
- NVIDIA Ingest is an early access set of microservices for parsing hundreds of thousands of complex, messy unstructured PDFs and other entβ¦β2,335Updated this week
- pingcap/autoflow is a Graph RAG based and conversational knowledge base tool built with TiDB Serverless Vector Storage. Demo: https://tidβ¦β2,224Updated this week
- Vision model based document ingestionβ1,312Updated this week
- napkins.dev β from screenshot to appβ1,027Updated last week
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β3,289Updated this week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β5,180Updated last week
- β712Updated 2 weeks ago
- π A better UX for chat, writing content, and coding with LLMs.β3,530Updated this week
- The open-source AI-native IDEβ1,648Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercelβ¦β4,307Updated this week
- An Open Source implementation of Notebook LM with more flexibility and featuresβ927Updated 2 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β813Updated 4 months ago
- RAG that intelligently adapts to your use case, data, and queriesβ2,798Updated last week
- π² An agent for sourcing, curating, and scheduling social media posts with human-in-the-loop.β822Updated this week
- A language model programming library.β5,568Updated last month
- A lightweight task engine for building stateful AI agents that prioritizes simplicity and flexibility.β878Updated 3 weeks ago
- Task-Aware Agent-driven Prompt Optimization Frameworkβ2,430Updated 2 weeks ago
- Parse files for optimal RAGβ3,573Updated this week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,335Updated 5 months ago
- Convert any PDF into a podcast episode!β1,864Updated last month
- Fully local web research and report writing assistantβ1,154Updated this week
- Open-source Next.js template for building apps that are fully generated by AI. By E2B.β3,795Updated this week
- PDF to Markdown with vision modelsβ9,212Updated last month
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ3,917Updated last week
- Detect and extract tables to markdown and csvβ720Updated this week
- An AI web browsing framework focused on simplicity and extensibility.β6,824Updated this week
- A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.β622Updated 2 weeks ago
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with PostgreSQL or SQLiteβ702Updated 2 weeks ago