getomni-ai / zerox
PDF to Markdown with vision models
β8,298Updated last month
Alternatives and similar repositories for zerox:
Users that are interested in zerox are comparing it to the libraries listed below
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ15,474Updated this week
- A language model programming library.β5,556Updated 3 weeks ago
- π₯ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.β21,693Updated this week
- Convert PDF to markdown + JSON quickly with high accuracyβ19,314Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β7,430Updated this week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β4,966Updated this week
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ5,974Updated 2 months ago
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AIβ18,641Updated last week
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.β9,387Updated this week
- π A better UX for chat, writing content, and coding with LLMs.β3,443Updated 2 weeks ago
- Automate browser-based workflows with LLMs and Computer Visionβ11,426Updated this week
- The easiest way to use Agentic RAG in any enterpriseβ3,972Updated 2 weeks ago
- π An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)β5,719Updated last week
- Anthropic's educational coursesβ8,665Updated last month
- Get your documents ready for gen AIβ18,239Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extractionβ6,398Updated 2 weeks ago
- Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.β17,869Updated this week
- Python scraper based on AIβ17,181Updated this week
- Document to Markdown OCR library with Llama 3.2 visionβ2,035Updated 2 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagβ¦β16,235Updated this week
- A simple screen parsing tool towards pure vision based GUI agentβ5,509Updated last week
- An open-source RAG-based tool for chatting with your documents.β20,315Updated this week
- library & platform to build, distribute, monetize ai apps that have the full context (like rewind, granola, etc.), open source, 100% locaβ¦β11,575Updated this week
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documentsβ¦β2,046Updated this week
- π₯ Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web wiβ¦β2,898Updated last week
- An AI-powered search engine with a generative UIβ6,626Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Modelβ6,576Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ6,639Updated this week
- A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic APIβ7,411Updated 3 weeks ago
- No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documentsβ3,346Updated this week