cleanlab / cleanlab-tlmLinks
Python client library for Cleanlab Trustworthy Language Model
☆23Updated this week
Alternatives and similar repositories for cleanlab-tlm
Users that are interested in cleanlab-tlm are comparing it to the libraries listed below
Sorting:
- Extending Conformal Prediction to LLMs☆67Updated last year
- ☆74Updated last year
- Experimental library integrating LLM capabilities to support causal analyses☆248Updated last month
- Interpret text data using LLMs (scikit-learn compatible).☆170Updated last month
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆117Updated this week
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated 6 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- Explore/examine/explain/expose your model with the explabox!☆18Updated 3 weeks ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆126Updated last year
- Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data!☆25Updated 5 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆24Updated 2 months ago
- Efficient multi-prompt evaluation of LLMs☆22Updated 9 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Updated last year
- Data and code for the Corr2Cause paper (ICLR 2024)☆111Updated last year
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆19Updated last month
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆232Updated 2 weeks ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆18Updated 3 months ago
- LLM Attributor: Attribute LLM's Generated Text to Training Data☆60Updated last week
- ☆249Updated 6 months ago
- A curated list of awesome academic research, books, code of ethics, data sets, institutes, maturity models, newsletters, principles, podc…☆84Updated this week
- Responsible AI knowledge base☆107Updated 2 years ago
- Ranking of fine-tuned HF models as base models.☆36Updated last week
- Code for Benchmarking Language Model Agents for Data-Driven Science☆31Updated 11 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆97Updated 9 months ago
- ☆80Updated last year
- ☆81Updated this week
- Notebooks for training universal 0-shot classifiers on many different tasks☆136Updated 8 months ago
- Finding semantically meaningful and accurate prompts.☆48Updated last year
- TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL☆23Updated 2 months ago