Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples
☆44Jul 16, 2025Updated 7 months ago
Alternatives and similar repositories for SelectiveDPO
Users that are interested in SelectiveDPO are comparing it to the libraries listed below
Sorting:
- Code for AAAI21 paper "Scalable and Explainable 1-Bit Matrix Completion via Graph Signal Learning"☆11Feb 15, 2022Updated 4 years ago
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- Structure-based out-of-distribution (OOD) material property prediction: a benchmark study☆15May 17, 2025Updated 9 months ago
- DYNAIL: Dynamics Adapted Imitation Learning☆14Jul 11, 2023Updated 2 years ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 10 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆38May 26, 2025Updated 9 months ago
- Group-conditional DRO to alleviate spurious correlations☆15Jul 15, 2021Updated 4 years ago
- Code repository for the paper "Invariant and Transportable Representations for Anti-Causal Domain Shifts"☆16Jul 4, 2022Updated 3 years ago
- [ICLR 2023] Learnable Randomness Injection (LRI) for interpretable Geometric Deep Learning.☆24Jul 18, 2023Updated 2 years ago
- NeurIPS'22 Oral: EquiVSet - Learning Neural Set Functions Under the Optimal Subset Oracle☆21Dec 23, 2022Updated 3 years ago
- Pytorch implementation of EvenNet.☆20Oct 25, 2022Updated 3 years ago
- This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)☆25Sep 10, 2024Updated last year
- Cross-Domain Imitation Learning via Optimal Transport☆25Jun 24, 2022Updated 3 years ago
- ☆23Feb 8, 2022Updated 4 years ago
- ☆23Jun 15, 2022Updated 3 years ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Mar 7, 2024Updated last year
- This is the implementation of our CVPR'23 paper "Class-Conditional Sharpness-Aware Minimization for Deep Long-Tailed Recognition".☆21Dec 16, 2023Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated last month
- [ICLR'22] Self-supervised learning optimally robust representations for domain shift.☆25Feb 2, 2022Updated 4 years ago
- ☆29Jul 12, 2022Updated 3 years ago
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆68Mar 29, 2023Updated 2 years ago
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆80Mar 27, 2024Updated last year
- Rethinking Graph Regularization for Graph Neural Networks (AAAI2021)☆34Jun 6, 2021Updated 4 years ago
- Molecular Out-Of-Distribution☆39Apr 16, 2025Updated 10 months ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆52May 12, 2025Updated 9 months ago
- [ICML 2022] pGNN, p-Laplacian Based Graph Neural Networks☆27Aug 26, 2025Updated 6 months ago
- ☆38Jul 13, 2022Updated 3 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆13Jun 18, 2025Updated 8 months ago
- G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)☆29Jan 11, 2022Updated 4 years ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆36Jun 8, 2023Updated 2 years ago
- [NeurIPS 2023] "Rethinking Tokenizer and Decoder in Masked Graph Modeling for Molecules"☆40Mar 16, 2024Updated last year
- ☆40Aug 12, 2024Updated last year
- [ICLR 2022] Understanding and Improving Graph Injection Attack by Promoting Unnoticeability☆38Nov 27, 2023Updated 2 years ago
- This is the source code for Detecting Adversarial Data by Probing Multiple Perturbations Using Expected Perturbation Score (ICML2023).☆40Oct 15, 2024Updated last year
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 7 months ago
- Systematic Multi-Trait AAV Capsid Engineering for Efficient Gene Delivery (Eid et al., Nature Communications, 2024)☆11Aug 26, 2024Updated last year
- Graphical intuition to MOSFET square-law☆11Jan 5, 2021Updated 5 years ago