kiddyboots216 / lottery-ticket-adaptationLinks
Lottery Ticket Adaptation
☆39Updated last year
Alternatives and similar repositories for lottery-ticket-adaptation
Users that are interested in lottery-ticket-adaptation are comparing it to the libraries listed below
Sorting:
- A repository for research on medium sized language models.☆77Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- When Reasoning Meets Its Laws☆34Updated 2 weeks ago
- ☆33Updated last year
- ☆91Updated last year
- Understanding the correlation between different LLM benchmarks☆29Updated 2 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆31Updated last year
- Resa: Transparent Reasoning Models via SAEs☆47Updated 3 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆41Updated 3 weeks ago
- ☆55Updated last year
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Updated 7 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Updated 3 months ago
- ☆19Updated last year
- The repository contains code for Adaptive Data Optimization☆30Updated last year
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆31Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Updated 2 weeks ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆35Updated last year
- Exploration of automated dataset selection approaches at large scales.☆53Updated 10 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Updated 5 months ago
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆47Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆47Updated 10 months ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆64Updated 2 weeks ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆41Updated 6 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆32Updated 5 months ago
- ☆39Updated last year
- ☆29Updated 2 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Updated 9 months ago