Unstructured-IO / unstructuredLinks
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
☆11,525Updated this week
Alternatives and similar repositories for unstructured
Users that are interested in unstructured are comparing it to the libraries listed below
Sorting:
- Build resilient language agents as graphs.☆14,228Updated last week
- structured outputs for llms☆10,747Updated last week
- DSPy: The framework for programming—not prompting—language models☆25,466Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆24,159Updated this week
- Structured Text Generation☆11,843Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆9,535Updated this week
- A guidance language for controlling large language models.☆20,336Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,165Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆42,374Updated this week
- the AI-native open-source embedding database☆20,571Updated this week
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆16,917Updated this week
- Build Conversational AI in minutes ⚡️☆9,967Updated 2 weeks ago
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆12,700Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆46,091Updated this week
- Adding guardrails to large language models.☆5,104Updated 2 weeks ago
- 💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows☆11,110Updated last week
- ☆5,926Updated 2 weeks ago
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,169Updated 2 months ago
- 😎 Awesome list of tools and projects with the awesome LangChain framework☆8,410Updated last month
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,595Updated 9 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,293Updated this week
- Open-source vector similarity search for Postgres☆16,098Updated 2 weeks ago
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆32,974Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,366Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆14,829Updated 3 months ago
- Knowledge Agents and Management in the Cloud☆4,014Updated this week
- Large Language Model Text Generation Inference☆10,236Updated this week
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆13,660Updated this week
- Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.☆19,933Updated 3 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆25,911Updated this week