truera / trulens
Evaluation and Tracking for LLM Experiments
☆2,047Updated this week
Related projects: ⓘ
- AI Observability & Evaluation☆3,465Updated this week
- Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines☆6,560Updated this week
- The LLM Evaluation Framework☆2,981Updated this week
- Superfast AI decision making and intelligent processing of multi-modal data.☆1,890Updated last week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆2,817Updated 2 weeks ago
- LangServe 🦜️🏓☆1,877Updated this week
- Adding guardrails to large language models.☆3,873Updated this week
- A comprehensive guide to building RAG-based LLM applications for production.☆1,671Updated last month
- Parse files for optimal RAG☆2,450Updated this week
- Harness LLMs with Multi-Agent Programming☆2,293Updated this week
- An awesome & curated list of best LLMOps tools for developers☆3,730Updated last week
- Build resilient language agents as graphs.☆5,662Updated this week
- Open-source tool to visualise your RAG 🔮☆1,059Updated 6 months ago
- ☆772Updated 10 months ago
- ☆746Updated 2 weeks ago
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Llam…☆5,596Updated this week
- Efficient Retrieval Augmentation and Generation Framework☆1,255Updated last week
- To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x com…☆4,435Updated 3 weeks ago
- structured outputs for llms☆7,529Updated this week
- A real world full-stack application using LlamaIndex☆2,321Updated last month
- Developer APIs to Accelerate LLM Projects☆1,329Updated last month
- NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.☆3,978Updated this week
- ☆1,705Updated this week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,451Updated last month
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,080Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆8,446Updated this week
- Reference implementations of several LangChain agents as Streamlit apps☆1,240Updated last month
- Build Conversational AI in minutes ⚡️☆6,762Updated this week
- 🔍 LangKit: An open-source toolkit for monitoring Large Language Models (LLMs). 📚 Extracts signals from prompts & responses, ensuring sa…☆816Updated last month
- The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval☆847Updated 2 weeks ago