apple / ml-entity-deduction-arena
☆26Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for ml-entity-deduction-arena
- ☆40Updated 7 months ago
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆15Updated 4 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆24Updated 2 months ago
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆52Updated last month
- Evaluation of neuro-symbolic engines☆33Updated 3 months ago
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆64Updated 2 years ago
- Repository for the paper Stream of Search: Learning to Search in Language☆84Updated 3 months ago
- ☆24Updated 6 months ago
- ☆44Updated last year
- ☆73Updated 4 months ago
- ☆50Updated last week
- Language models scale reliably with over-training and on downstream tasks☆94Updated 7 months ago
- Minimum Description Length probing for neural network representations☆16Updated last week
- Embedding Recycling for Language models☆38Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆41Updated 9 months ago
- ☆50Updated last month
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆55Updated last year
- Generating and validating natural-language explanations.☆40Updated this week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluating☆29Updated this week
- NanoGPT-like codebase for LLM training☆73Updated this week
- Critique-out-Loud Reward Models☆36Updated 3 weeks ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆49Updated last year
- Sparse and discrete interpretability tool for neural networks☆53Updated 8 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆48Updated 7 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆24Updated 6 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆38Updated last month
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆56Updated 2 months ago