getomni-ai / zerox
PDF to Markdown with vision models
☆6,324Updated this week
Related projects ⓘ
Alternatives and complementary repositories for zerox
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆18,840Updated this week
- Get your documents ready for gen AI☆9,923Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆14,240Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆6,985Updated this week
- Build real-time multimodal AI applications 🤖🎙️📹☆4,010Updated this week
- rewind.ai x cursor.com = your AI assistant that has all the context. 24/7 screen & voice recording for the age of super intelligence. get…☆9,010Updated this week
- An open-source RAG-based tool for chatting with your documents.☆17,436Updated this week
- Anthropic's educational courses☆8,064Updated last month
- Open-source Next.js template for building apps that are fully generated by AI. By E2B.☆3,471Updated this week
- Automate browser-based workflows with LLMs and Computer Vision☆10,475Updated this week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆2,602Updated last week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.☆2,186Updated 3 months ago
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆3,677Updated last week
- An AI-powered search engine with a generative UI☆6,304Updated 2 weeks ago
- A simple screen parsing tool towards pure vision based GUI agent☆4,768Updated 2 weeks ago
- The easiest way to use Agentic RAG in any enterprise☆3,866Updated this week
- Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks☆5,648Updated 2 weeks ago
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆6,899Updated last week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,660Updated last month
- Document to Markdown OCR library with Llama 3.2 vision☆1,345Updated last week
- 🔍 AI search engine - self-host with local or cloud LLMs☆2,749Updated last month
- Make websites accessible for AI agents☆2,094Updated this week
- Build AI Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.☆15,471Updated this week
- Open source Claude Artifacts – built with Llama 3.1 405B☆3,555Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆12,286Updated last week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆6,053Updated this week
- Chat first code editor. To download the packaged app:☆5,124Updated last week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆15,527Updated this week
- A language model programming library.☆5,295Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆13,971Updated this week