cvs-health / uqlmLinks
UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection
☆1,052Updated this week
Alternatives and similar repositories for uqlm
Users that are interested in uqlm are comparing it to the libraries listed below
Sorting:
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,263Updated 5 months ago
- Tool for generating high quality Synthetic datasets☆1,255Updated last week
- Semantic Chunker is a lightweight Python package for semantically-aware chunking and clustering of text.☆273Updated 6 months ago
- LettuceDetect is a hallucination detection framework for RAG applications.☆504Updated last month
- Build datasets using natural language☆529Updated 3 weeks ago
- An open-source tool for LLM prompt optimization.☆642Updated last week
- ☆1,113Updated last week
- Create large-scale synthetic training data for model distillation and evaluation☆591Updated this week
- A single interface to use and evaluate different agent frameworks☆977Updated this week
- Readymade evaluators for your LLM apps☆744Updated last month
- Open source RAG evaluation package☆310Updated this week
- Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.☆460Updated 4 months ago
- Python package and backend for the Elysia platform app.☆1,731Updated this week
- Readymade evaluators for agent trajectories☆345Updated last month
- For your multi-agent needs☆1,167Updated last week
- ☆1,073Updated last month
- ☆309Updated 5 months ago
- This repository shares end-to-end notebooks on how to use various Weaviate features and integrations!☆896Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,086Updated last week
- 📝 Automatically annotate papers using LLMs☆354Updated 5 months ago
- Agent File (.af): An open file format for serializing stateful AI agents with persistent memory and behavior. Share, checkpoint, and vers…☆939Updated 4 months ago
- Fast State-of-the-Art Static Embeddings☆1,858Updated last week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆402Updated last week
- ☆452Updated last month
- open-source framework for creating and managing simulations populated with AI-powered agents. It provides an intuitive platform for desig…☆933Updated 8 months ago
- OCR Benchmark☆572Updated 4 months ago
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆425Updated 6 months ago
- ☆584Updated 7 months ago
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,410Updated 4 months ago
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆798Updated this week