cleanlab / cleanlab-tools
Cookbooks showcasing various applications of Cleanlab
☆15Updated this week
Alternatives and similar repositories for cleanlab-tools
Users that are interested in cleanlab-tools are comparing it to the libraries listed below
Sorting:
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- ☆29Updated last year
- ☆28Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Train your own SOTA deductive reasoning model☆92Updated 2 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆77Updated last month
- Framework for building, orchestrating and deploying multi-agent systems. Managed by OpenAI Solutions team. Experimental framework.☆91Updated 7 months ago
- Simple AI agents / assistants☆45Updated 7 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆76Updated 3 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 9 months ago
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆16Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆104Updated 5 months ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- The next evolution of Agents☆48Updated 3 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- ☆45Updated last year
- ☆114Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- Code for ScribeAgent paper☆57Updated 2 months ago
- Claude API Test Project☆87Updated last year
- Dynamic Metadata based RAG Framework☆75Updated 9 months ago
- Function Calling Benchmark & Testing☆87Updated 10 months ago
- Simple GRPO scripts and configurations.☆58Updated 3 months ago
- Automatic Prompt Optimization☆35Updated last year
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆37Updated last week
- auto fine tune of models with synthetic data☆75Updated last year
- RAG example using DSPy, Gradio, FastAPI☆79Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆110Updated 3 months ago