google-deepmind / exedec
☆12Updated last year
Alternatives and similar repositories for exedec
Users that are interested in exedec are comparing it to the libraries listed below
Sorting:
- Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…☆26Updated 2 years ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Updated 3 months ago
- Rewarded soups official implementation☆57Updated last year
- ☆40Updated last year
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Updated 9 months ago
- ☆15Updated 6 months ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆15Updated last month
- The code of paper "Toward Optimal LLM Alignments Using Two-Player Games".☆16Updated 10 months ago
- ☆14Updated last year
- ☆31Updated last year
- ☆31Updated 6 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 2 months ago
- Self-Supervised Alignment with Mutual Information☆18Updated 11 months ago
- ☆25Updated 8 months ago
- ☆18Updated 10 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆36Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 8 months ago
- Code and data for the paper "Understanding Hidden Context in Preference Learning: Consequences for RLHF"☆29Updated last year
- ☆18Updated last year
- ☆18Updated 3 months ago
- ☆34Updated last year
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆78Updated 6 months ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆35Updated 6 months ago
- ☆13Updated 8 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆27Updated last year
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Updated 3 weeks ago
- ☆28Updated 2 months ago
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated 8 months ago
- ☆22Updated 3 months ago
- ☆16Updated 8 months ago