Thinklab-SJTU / BiLAFLinks
Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"
☆14Updated 11 months ago
Alternatives and similar repositories for BiLAF
Users that are interested in BiLAF are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Updated 2 years ago
- [NeurIPS 2024 Datasets and Benchmarks Track] Benchmarking PtO and PnO Methods in the Predictive Combinatorial Optimization Regime☆23Updated 10 months ago
- [ICLR 2023, ICLR DG oral] PAIR, the optimizer and model selection criteria for OOD Generalization☆54Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated last year
- [ICML 2023] Taxonomy-Structured Domain Adaptation☆12Updated 2 years ago
- PyTorch implementation of "Online Hyperparameter Optimization for Class-Incremental Learning" (AAAI 2023 Oral)☆17Updated 2 years ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆42Updated 4 months ago
- ☆11Updated last year
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆47Updated last year
- Code for ICLR'24 workshop ME-FoMo-How Well Does GPT-4V(ision) Adapt to Distribution Shifts? A Preliminary Investigation☆38Updated last year
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆42Updated last year
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆20Updated last year
- ☆12Updated last year
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Updated 11 months ago
- Unofficial Implementation of Selective Attention Transformer☆20Updated last year
- PyTorch implementation of the paper "Discovering and Explaining the Representation Bottleneck of DNNs" (ICLR 2022 Oral)☆37Updated last year
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆99Updated last year
- (NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights☆28Updated last year
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Updated 11 months ago
- Official implementation for Sparse MetA-Tuning (SMAT)☆17Updated 6 months ago
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆47Updated last year
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆68Updated 2 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Updated last year
- exploring whether LLMs perform case-based or rule-based reasoning☆30Updated last year
- Minimalist RL for Diffusion LLMs with SOTA reasoning performance (89.1% GSM8K). Official implementation of "The Flexibility Trap".☆111Updated 2 weeks ago
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- [ICLR 2026] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"☆48Updated this week
- This is the project for IRM methods☆12Updated 4 years ago
- ICLR2024 statistics☆47Updated 2 years ago
- [ICML 2024] How Interpretable Are Interpretable Graph Neural Networks?☆15Updated last year