apple / ml-auraLinks
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024
☆23Updated last year
Alternatives and similar repositories for ml-aura
Users that are interested in ml-aura are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Source code of "Dr.LLM: Dynamic Layer Routing in LLMs"☆39Updated last month
- ☆56Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 9 months ago
- ☆43Updated 4 years ago
- A package dedicated for running benchmark agreement testing☆18Updated 2 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last month
- ☆82Updated this week
- Minimum Description Length probing for neural network representations☆20Updated 9 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- ☆26Updated 8 months ago
- ☆55Updated last year
- ☆52Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Updated last year
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆76Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- ☆39Updated last year
- ☆34Updated last year
- Aioli: A unified optimization framework for language model data mixing☆28Updated 10 months ago
- ☆26Updated last year
- ☆78Updated 3 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆68Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆31Updated last year
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated 2 years ago
- ☆44Updated last year
- MEXMA: Token-level objectives improve sentence representations☆42Updated 10 months ago