apple / ml-auraLinks
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024
☆25Updated last year
Alternatives and similar repositories for ml-aura
Users that are interested in ml-aura are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- ☆57Updated last year
- ☆29Updated 11 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Updated last year
- Minimum Description Length probing for neural network representations☆20Updated last year
- ☆44Updated last year
- Embedding Recycling for Language models☆38Updated 2 years ago
- ☆91Updated last month
- ☆44Updated 4 years ago
- Understanding the correlation between different LLM benchmarks☆29Updated 2 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Updated 4 months ago
- MEXMA: Token-level objectives improve sentence representations☆42Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Updated last year
- Code for the ACL 2023 paper: "Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Sc…☆35Updated 2 years ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Updated last year
- Source code of "Dr.LLM: Dynamic Layer Routing in LLMs"☆41Updated 3 months ago
- PyTorch library for Active Fine-Tuning☆96Updated 4 months ago
- ☆59Updated last year
- Finding semantically meaningful and accurate prompts.☆48Updated 2 years ago
- Generating Summaries with Controllable Readability Levels (EMNLP 2023)☆14Updated 6 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆28Updated last year
- Supercharge huggingface transformers with model parallelism.☆78Updated 6 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- ☆53Updated 2 years ago
- ☆77Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated 2 years ago