DS4SD / docling
Get your documents ready for gen AI
β18,239Updated this week
Alternatives and similar repositories for docling:
Users that are interested in docling are comparing it to the libraries listed below
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β4,966Updated this week
- π₯ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.β21,693Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraperβ25,127Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ15,474Updated this week
- Automate browser-based workflows with LLMs and Computer Visionβ11,426Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagβ¦β16,235Updated this week
- Agent Framework / shim to use Pydantic with LLMsβ5,346Updated this week
- An open-source RAG-based tool for chatting with your documents.β20,315Updated this week
- Convert PDF to markdown + JSON quickly with high accuracyβ19,314Updated this week
- The Memory layer for your AI appsβ23,953Updated this week
- library & platform to build, distribute, monetize ai apps that have the full context (like rewind, granola, etc.), open source, 100% locaβ¦β11,575Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AIβ18,641Updated this week
- Composable building blocks to build Llama Appsβ6,036Updated this week
- π€ Open-source AI Agent that empowers data-driven teams to chat with their data to generate Text-to-SQL, charts, spreadsheets, reports, aβ¦β4,336Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviateβ6,639Updated this week
- PDF to Markdown with vision modelsβ8,298Updated last month
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.β9,387Updated this week
- π€ smolagents: a barebones library for agents. Agents write python code to call tools and orchestrate other agents.β5,197Updated this week
- Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.β17,869Updated this week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.β5,233Updated 2 weeks ago
- Flexible and powerful framework for managing multiple AI agents and handling complex conversationsβ3,835Updated this week
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.β12,576Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extractionβ6,398Updated 2 weeks ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β22,748Updated this week
- structured outputs for llmsβ8,909Updated this week
- Ingest, parse, and optimize any data format β‘οΈ from documents to multimedia β‘οΈ for enhanced compatibility with GenAI frameworksβ5,974Updated 2 months ago
- CrawleeβA web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Dowβ¦β5,077Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.β9,785Updated this week
- From RAG chatbots to code assistants to complex agentic pipelines and beyond, build LLM systems that run better, faster, and cheaper withβ¦β4,430Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet. Powered by Vercelβ¦β3,891Updated this week