r-three / realistic_evaluation_of_model_merging_for_compositional_generalizationLinks
☆12Updated 9 months ago
Alternatives and similar repositories for realistic_evaluation_of_model_merging_for_compositional_generalization
Users that are interested in realistic_evaluation_of_model_merging_for_compositional_generalization are comparing it to the libraries listed below
Sorting:
- ☆20Updated last year
- ☆50Updated last year
- ☆89Updated last year
- ☆27Updated 5 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆31Updated 6 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆81Updated 9 months ago
- ☆14Updated last year
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated last year
- ☆17Updated last year
- Self-Supervised Alignment with Mutual Information☆21Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆18Updated last year
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated last month
- ☆34Updated 7 months ago
- ☆23Updated 6 months ago
- Exploration of automated dataset selection approaches at large scales.☆47Updated 5 months ago
- The repository contains code for Adaptive Data Optimization☆25Updated 8 months ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆24Updated last year
- ☆27Updated 2 years ago
- ☆15Updated last year
- Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)☆65Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆112Updated last month
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆12Updated 4 months ago
- ☆71Updated 3 years ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 3 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆59Updated 2 weeks ago
- ☆51Updated 4 months ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Updated 3 months ago