apple / ml-auraLinks

Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024

☆21

Alternatives and similar repositories for ml-aura

Users that are interested in ml-aura are comparing it to the libraries listed below

Sorting:

ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 11 months ago
apple / ml-planner
☆54Updated last year
apple / ml-entity-deduction-arena
☆33Updated last year
jaehunjung1 / cascaded-selective-evaluation
☆26Updated 5 months ago
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆86Updated last year
ShiZhengyan / PowerfulPromptFT
[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…
☆74Updated last year
EleutherAI / semantic-memorization
☆44Updated 8 months ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆44Updated last year
kyegomez / Infini-attention
Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…
☆56Updated last week
ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆91Updated 8 months ago
justinlovelace / Diffusion-Guided-LM
☆27Updated last year
srush / LLM-Talk
☆51Updated last year
kaistAI / factual-knowledge-acquisition
☆21Updated 3 months ago
nuochenpku / LLaMA_Analysis
This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
☆30Updated last year
Nix07 / finetuning
This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…
☆27Updated last year
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 6 months ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
Zyphra / Zyda_processing
☆37Updated last year
JoshEngels / MultiDimensionalFeatures
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆77Updated 8 months ago
facebookresearch / mexma
MEXMA: Token-level objectives improve sentence representations
☆41Updated 7 months ago
lucidrains / AMIE-pytorch
Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind
☆66Updated 10 months ago
adihaviv / nopos
☆22Updated 2 years ago
hadasah / btm
☆75Updated last year
IBM / benchbench
A package dedicated for running benchmark agreement testing
☆17Updated 2 months ago
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated last year
mcleish7 / gemstone-scaling-laws
☆27Updated 5 months ago
allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆39Updated 2 years ago
bminixhofer / tokenkit
A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.
☆40Updated last month
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆27Updated 6 months ago