adithya-s-k / omniparseLinks
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
☆6,558Updated this week
Alternatives and similar repositories for omniparse
Users that are interested in omniparse are comparing it to the libraries listed below
Sorting:
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,806Updated 3 weeks ago
- This project is a template for my help my pupils from SENAC build your own project Html5☆11Updated 10 months ago
- OCR & Document Extraction using vision models☆11,232Updated last week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,378Updated 4 months ago
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆38,945Updated this week
- Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you ne…☆7,930Updated this week
- Question and Answer based on Anything.☆13,213Updated 2 months ago
- 🤖 Open-source GenBI AI Agent that empowers data-driven teams to chat with their databases to generate Text-to-SQL, BI and embedded AI. �…☆7,900Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,439Updated last week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,133Updated 2 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,084Updated 3 months ago
- ChatOllama is an open source chatbot based on LLMs. It supports a wide range of language models, and knowledge base management.☆3,159Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆6,449Updated 3 months ago
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆34,203Updated this week
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆7,581Updated 3 months ago
- 🔍 AI search engine - self-host with local or cloud LLMs☆3,315Updated 8 months ago
- 💬 MaxKB is an open-source AI assistant for enterprise. It seamlessly integrates RAG pipelines, supports robust workflows, and provides M…☆16,688Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆11,355Updated this week
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,706Updated last year
- Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.☆6,074Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆12,482Updated this week
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.☆17,856Updated last month
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆8,967Updated last week
- GraphRAG using Local LLMs - Features robust API and multiple apps for Indexing/Prompt Tuning/Query/Chat/Visualizing/Etc. This is meant to…☆2,151Updated 6 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆24,399Updated 3 weeks ago
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆44,523Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆22,098Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,508Updated this week
- Python scraper based on AI☆19,817Updated this week
- Build a Perplexity-Inspired Answer Engine Using Next.js, Groq, Llama-3, Langchain, OpenAI, Upstash, Brave & Serper☆4,896Updated 8 months ago