Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples
☆44Jul 16, 2025Updated 8 months ago
Alternatives and similar repositories for SelectiveDPO
Users that are interested in SelectiveDPO are comparing it to the libraries listed below
Sorting:
- DYNAIL: Dynamics Adapted Imitation Learning☆14Jul 11, 2023Updated 2 years ago
- Reproducing R1 for Code with Reliable Rewards☆12Apr 9, 2025Updated 11 months ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆32Jan 7, 2026Updated 2 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆38May 26, 2025Updated 9 months ago
- some examples for drawing illustration plots for paper using seaborn package☆15Sep 22, 2019Updated 6 years ago
- Systematic Multi-Trait AAV Capsid Engineering for Efficient Gene Delivery (Eid et al., Nature Communications, 2024)☆11Aug 26, 2024Updated last year
- Code for ICML21 paper "Learning Self-Modulating Attention in Continuous Time Space with Applications to Sequential Recommendation"☆12Feb 8, 2023Updated 3 years ago
- Structure-based out-of-distribution (OOD) material property prediction: a benchmark study☆15May 17, 2025Updated 10 months ago
- This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)☆26Sep 10, 2024Updated last year
- AAAI2025☆11Apr 18, 2025Updated 11 months ago
- This is the implementation of our CVPR'23 paper "Class-Conditional Sharpness-Aware Minimization for Deep Long-Tailed Recognition".☆21Dec 16, 2023Updated 2 years ago
- Data Files for "Deep diversification of an AAV capsid protein by machine learning"☆18Mar 9, 2021Updated 5 years ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Mar 7, 2024Updated 2 years ago
- [TPAMI 2024] The official implementation of "Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clu…☆11Mar 19, 2024Updated 2 years ago
- A repository contains a collection of resources and papers on Imbalance Learning On Graphs☆96May 29, 2025Updated 9 months ago
- [CIKM 2025] Constraint Back-translation Improves Complex Instruction Following of Large Language Models☆17May 23, 2025Updated 9 months ago
- ☆11Dec 8, 2022Updated 3 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- [ICML 2022] pGNN, p-Laplacian Based Graph Neural Networks☆27Aug 26, 2025Updated 6 months ago
- Source code of a ICML2021 paper, A Bit More Bayesian: Domain-Invariant Learning with Uncertainty☆12Jan 20, 2022Updated 4 years ago
- ☆12Nov 15, 2022Updated 3 years ago
- ☆12Sep 15, 2021Updated 4 years ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- ☆14Jun 6, 2023Updated 2 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- NeurIPS'22 Oral: EquiVSet - Learning Neural Set Functions Under the Optimal Subset Oracle☆21Dec 23, 2022Updated 3 years ago
- [ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.☆13Oct 9, 2024Updated last year
- Unofficial baselines for ManiSkill, including RL and BC algorithms.☆18Jun 6, 2024Updated last year
- Official implementation of "Fair Resource Allocation in Multi-Task Learning" [ICML 2024]☆17Oct 22, 2024Updated last year
- Code for our paper "Auxiliary Task Reweighting for Minimum-data Learning" (NeurIPS 2020)☆18Dec 21, 2020Updated 5 years ago
- 广东省“珠江人才计划”——服务机器人智能引擎平台☆56May 14, 2024Updated last year
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆14Feb 3, 2023Updated 3 years ago
- Group-conditional DRO to alleviate spurious correlations☆15Jul 15, 2021Updated 4 years ago
- ☆25Dec 13, 2024Updated last year
- [AAAI 21] Utilizing meta-learning to correct the noisy labels.☆15Apr 26, 2021Updated 4 years ago
- This repo contains the code of "Structure-Aware Transformer Policy for Inhomogeneous Multi-Task Reinforcement Learning".☆13May 20, 2022Updated 3 years ago
- ☆11Jun 2, 2022Updated 3 years ago
- Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…☆38Oct 24, 2025Updated 4 months ago