apple / ml-aura
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024
☆18Updated 6 months ago
Alternatives and similar repositories for ml-aura:
Users that are interested in ml-aura are comparing it to the libraries listed below
- ☆12Updated 9 months ago
- ☆45Updated 10 months ago
- ☆12Updated 6 months ago
- Generating and validating natural-language explanations.☆46Updated 3 weeks ago
- ☆29Updated 7 months ago
- ☆23Updated 2 months ago
- Minimum Description Length probing for neural network representations☆18Updated this week
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆21Updated 5 months ago
- Aioli: A unified optimization framework for language model data mixing☆19Updated last week
- Tasks for describing differences between text distributions.☆16Updated 5 months ago
- Multilingual Knowledge Graph Enhancement (EMNLP 2023)☆21Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆53Updated 4 months ago
- ☆39Updated 6 months ago
- Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. EMNLP 2024☆20Updated 2 months ago
- ☆12Updated 5 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 7 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆55Updated 4 months ago
- Understanding how features learned by neural networks evolve throughout training☆32Updated 3 months ago
- ☆18Updated 7 months ago
- ☆19Updated last year
- Lottery Ticket Adaptation☆37Updated 2 months ago
- The repository contains code for Adaptive Data Optimization☆20Updated last month
- ☆30Updated 10 months ago
- This repository contains the official implementation for the ECCV'22 paper, "SPIN: An Empirical Evaluation on Sharing Parameters of Isotr…☆20Updated last year
- ☆34Updated last year
- ☆17Updated 3 months ago
- ☆28Updated last year
- ☆48Updated 11 months ago
- ☆31Updated last year
- ☆24Updated 5 months ago