apple / ml-auraLinks
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024
☆22Updated last year
Alternatives and similar repositories for ml-aura
Users that are interested in ml-aura are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- Aioli: A unified optimization framework for language model data mixing☆27Updated 8 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated last year
- ☆29Updated last month
- ☆44Updated 10 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- ☆55Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆19Updated this week
- ☆57Updated 11 months ago
- ☆26Updated 6 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated last month
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆26Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated 2 years ago
- Embedding Recycling for Language models☆39Updated 2 years ago
- ☆34Updated last year
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆75Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated last week
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 7 months ago
- ☆16Updated 5 months ago
- ☆54Updated 10 months ago
- ☆39Updated last year
- ☆43Updated 3 years ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated 2 years ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated 9 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆88Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆56Updated this week