cleanlab / cleanlab-studioLinks
Client interface to Cleanlab Studio
☆32Updated 9 months ago
Alternatives and similar repositories for cleanlab-studio
Users that are interested in cleanlab-studio are comparing it to the libraries listed below
Sorting:
- Fiddler Auditor is a tool to evaluate language models.☆188Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆115Updated 8 months ago
- ☆80Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- Solving data for LLMs - Create quality synthetic datasets!☆150Updated 10 months ago
- A Lightweight Library for AI Observability☆252Updated 9 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆68Updated 3 weeks ago
- Simple examples using Argilla tools to build AI☆56Updated last year
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆116Updated 4 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆113Updated 5 months ago
- Verbosity control for AI agents☆64Updated last year
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆81Updated 10 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆49Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆107Updated 2 months ago
- Dynamic Metadata based RAG Framework☆78Updated last week
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆45Updated last year
- Simple UI for debugging correlations of text embeddings☆302Updated 6 months ago
- Streamlit app for recommending eval functions using prompt diffs☆30Updated last year
- ☆40Updated last year
- Synthetic Text Dataset Generation for LLM projects☆52Updated 2 weeks ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- ☆148Updated last year
- ☆210Updated 5 months ago
- Simple AI agents / assistants☆51Updated last year
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆154Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆89Updated 2 weeks ago
- Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation☆108Updated 11 months ago
- ☆160Updated last year