Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
☆14,214Mar 4, 2026Updated last week
Alternatives and similar repositories for unstructured
Users that are interested in unstructured are comparing it to the libraries listed below
Sorting:
- LlamaIndex is the leading document agent and OCR platform☆47,608Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,696Updated this week
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆24,436Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆38,879Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆12,927Feb 24, 2026Updated 2 weeks ago
- structured outputs for llms☆12,512Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆31,474Updated this week
- Universal memory layer for AI Agents☆49,365Updated this week
- The agent engineering platform☆129,503Updated this week
- A programming framework for agentic AI☆55,559Updated this week
- Structured Outputs☆13,539Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆32,278Mar 4, 2026Updated last week
- Get your documents ready for gen AI☆55,513Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆72,827Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,266Mar 4, 2026Updated last week
- Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.☆21,579Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆45,821Updated this week
- Build, run, manage agentic software at scale.☆38,700Updated this week
- Open-source search and retrieval database for AI applications.☆26,523Mar 9, 2026Updated last week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆23,113Updated this week
- A guidance language for controlling large language models.☆21,346Updated this week
- Build AI Agents, Visually☆50,762Updated this week
- Build Conversational AI in minutes ⚡️☆11,691Mar 5, 2026Updated last week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆74,968Updated this week
- Knowledge Agents and Management in the Cloud☆4,244Feb 17, 2026Updated 3 weeks ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,431Mar 1, 2026Updated 2 weeks ago
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆15,795Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆29,491Updated this week
- An autonomous agent that conducts deep research on any data using any LLM providers