Unstructured-IO / unstructuredLinks
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.
☆11,759Updated this week
Alternatives and similar repositories for unstructured
Users that are interested in unstructured are comparing it to the libraries listed below
Sorting:
- Structured Text Generation☆11,963Updated this week
- structured outputs for llms☆10,824Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆42,633Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆9,703Updated this week
- the AI-native open-source embedding database☆20,659Updated this week
- DSPy: The framework for programming—not prompting—language models☆25,821Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆26,017Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆24,658Updated this week
- Build Conversational AI in minutes ⚡️☆10,013Updated 3 weeks ago
- Build resilient language agents as graphs.☆14,845Updated this week
- Knowledge Agents and Management in the Cloud☆4,031Updated this week
- Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Ge…☆7,366Updated this week
- Adding guardrails to large language models.☆5,171Updated 3 weeks ago
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆13,170Updated this week
- AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file convert…☆21,281Updated this week
- A guidance language for controlling large language models.☆20,372Updated this week
- Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.☆17,020Updated this week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,603Updated 9 months ago
- 💡 All-in-one open-source AI framework for semantic search, LLM orchestration and language model workflows☆11,140Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆5,201Updated 3 months ago
- Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.☆35,516Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,194Updated last week
- Go ahead and axolotl questions☆9,760Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆12,343Updated last week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,443Updated last week
- Agent Framework / shim to use Pydantic with LLMs☆10,487Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆14,935Updated 3 months ago
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆6,993Updated this week
- A Bulletproof Way to Generate Structured JSON from Language Models☆4,757Updated last year
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆7,177Updated 3 months ago