cleanlab / cleanlab-tlmLinks
Python client library for Cleanlab Trustworthy Language Model
☆24Updated 2 months ago
Alternatives and similar repositories for cleanlab-tlm
Users that are interested in cleanlab-tlm are comparing it to the libraries listed below
Sorting:
- Explore/examine/explain/expose your model with the explabox!☆19Updated 3 months ago
- Extending Conformal Prediction to LLMs☆69Updated last year
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆26Updated 6 months ago
- Interpret text data with LLMs (sklearn compatible).☆176Updated 2 weeks ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated 3 months ago
- Fairness toolkit for pytorch, scikit learn and autogluon☆33Updated 2 months ago
- PyTorch package to train and audit ML models for Individual Fairness☆66Updated 4 months ago
- ☆82Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆98Updated 6 months ago
- Efficient multi-prompt evaluation of LLMs☆28Updated last year
- Experimental library integrating LLM capabilities to support causal analyses☆287Updated last month
- Python package to compute interaction indices that extend the Shapley Value. AISTATS 2023.☆19Updated 2 years ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Updated 7 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆112Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Updated last year
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆27Updated 3 years ago
- ☆261Updated 10 months ago
- Client interface to Cleanlab Studio☆31Updated 11 months ago
- Evaluation of neuro-symbolic engines☆41Updated last year
- LLM Attributor: Attribute LLM's Generated Text to Training Data☆72Updated 4 months ago
- Cross-field empirical trends analysis of XAI literature☆22Updated 2 years ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆47Updated last year
- Unofficial implementation of Conformal Language Modeling by Quach et al☆29Updated 2 years ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆126Updated 3 months ago
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆28Updated 6 months ago
- TalkToModel gives anyone with the powers of XAI through natural language conversations 💬!☆126Updated 2 years ago
- Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!☆26Updated last month
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆252Updated last month