apple / ml-aura
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024
☆18Updated 8 months ago
Alternatives and similar repositories for ml-aura:
Users that are interested in ml-aura are comparing it to the libraries listed below
- ☆12Updated 11 months ago
- ☆45Updated 11 months ago
- ☆29Updated 9 months ago
- ☆13Updated 8 months ago
- Tasks for describing differences between text distributions.☆16Updated 7 months ago
- ☆25Updated last year
- ☆12Updated 6 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated 9 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆22Updated 6 months ago
- ☆31Updated last year
- Minimum Description Length probing for neural network representations☆19Updated last month
- Generating and validating natural-language explanations for the brain.☆49Updated this week
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆24Updated this week
- ☆17Updated 5 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- Code for "Merging Text Transformers from Different Initializations"☆19Updated last month
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated last month
- ☆39Updated 7 months ago
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆30Updated last year
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- ☆17Updated 3 months ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆24Updated 4 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Understanding how features learned by neural networks evolve throughout training☆33Updated 4 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆23Updated last month