Unstructured-IO / unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
☆10,581Updated this week
Alternatives and similar repositories for unstructured:
Users that are interested in unstructured are comparing it to the libraries listed below
- Build resilient language agents as graphs.☆10,484Updated this week
- the AI-native open-source embedding database☆18,797Updated this week
- Build Conversational AI in minutes ⚡️☆8,944Updated this week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.☆40,106Updated this week
- DSPy: The framework for programming—not prompting—language models☆22,574Updated this week
- Supercharge Your LLM Application Evaluations 🚀☆8,514Updated last week
- 💡 All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows☆10,606Updated this week
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆9,536Updated this week
- Structured Text Generation☆11,109Updated this week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,468Updated 6 months ago
- ☆5,771Updated last week
- Knowledge Agents and Management in the Cloud☆3,791Updated this week
- Retrieval Augmented Generation (RAG) chatbot powered by Weaviate☆6,955Updated this week
- Adding guardrails to large language models.☆4,646Updated last week
- Open-source vector similarity search for Postgres☆14,641Updated last month
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆23,734Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆19,333Updated this week
- structured outputs for llms☆9,808Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆11,909Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆13,852Updated this week
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,957Updated last week
- Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!☆5,890Updated this week
- A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/aut…☆41,803Updated this week
- Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cl…☆22,553Updated this week
- AI Observability & Evaluation☆5,075Updated this week
- A guidance language for controlling large language models.☆19,918Updated this week
- 🦜🔗 Build context-aware reasoning applications☆103,849Updated this week
- 😎 Awesome list of tools and projects with the awesome LangChain framework☆8,103Updated 2 weeks ago
- AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file convert…☆19,905Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆5,735Updated this week