[ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models
☆24Feb 10, 2025Updated last year
Alternatives and similar repositories for ADPA
Users that are interested in ADPA are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion☆14Mar 17, 2025Updated 11 months ago
- ☆24Jul 25, 2024Updated last year
- [ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun…☆40Mar 20, 2022Updated 3 years ago
- Official PyTorch Implementation of Federated Learning with Positive and Unlabeled Data☆10Aug 12, 2022Updated 3 years ago
- ☆16Jun 25, 2025Updated 8 months ago
- NuART-Py: Python Library of Adaptive Resonance Theory Neural Network☆10Jan 26, 2020Updated 6 years ago
- ☆22Dec 11, 2025Updated 2 months ago
- A Caffe/C++ implementation of Deep Deterministic Policy Gradient☆10Feb 1, 2019Updated 7 years ago
- ☆14Aug 12, 2024Updated last year
- Debiasing Through Data Attribution☆12May 23, 2024Updated last year
- ☆31Feb 3, 2026Updated 3 weeks ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- ADAPTIVE RESONANCE THEORY. Gail A. Carpenter and Stephen Grossberg☆10Feb 10, 2015Updated 11 years ago
- [CVPR 2025] Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation☆19Dec 18, 2025Updated 2 months ago
- Source code for journal paper "Multiagent Reinforcement Learning With Sparse Interactions by Negotiation and Knowledge Transfer"☆13Dec 26, 2017Updated 8 years ago
- Code for the AAAI 2024 paper: "AGS: Affordable and Generalizable Substitute Training for Transferable Adversarial Attack" (accepted).☆12Mar 28, 2024Updated last year
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- This project implements two dynamic spatiotemporal interpolation (DST) methods, i.e., coarse-grained DST (CGDST) and fine-grained DST (FG…☆11Apr 15, 2022Updated 3 years ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 6 months ago
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- ☆10Jul 20, 2020Updated 5 years ago
- Official implementation for the paper "Sample-Then-Optimize Batch Neural Thompson Sampling", published at NeurIPS 2022.☆10Oct 13, 2022Updated 3 years ago
- Repository for score-based transport modeling.☆11Jul 22, 2023Updated 2 years ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- core placement optimization☆13Dec 25, 2021Updated 4 years ago
- ☆11Oct 20, 2023Updated 2 years ago
- [Neurips 2024] This repository is the official implementation of the Spatio-hemispherical equivariant convolution for dMRI deconvolution …☆10Dec 24, 2024Updated last year
- Code for Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent☆10Nov 19, 2020Updated 5 years ago
- Experiments with reasoning models, training techniques, papers☆24Updated this week
- Implementation of our NeurIPS 2019 paper: Subspace Attack: Exploiting Promising Subspaces for Query-Efficient Black-box Attacks☆10Dec 16, 2019Updated 6 years ago
- Code associated with our paper "Estimating Risk and Uncertainty in Reinforcement Learning"☆11Oct 3, 2023Updated 2 years ago
- Implementation of our ICLR 2021 paper: Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples.☆11Mar 9, 2021Updated 4 years ago
- TransMix: Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning☆11Oct 18, 2022Updated 3 years ago
- Google AI Research☆10Mar 11, 2020Updated 5 years ago
- Code of paper "HyperVLA: Efficient Inference in Vision-Language-Action Models via Hypernetworks"☆22Oct 8, 2025Updated 4 months ago
- ☆13Feb 21, 2025Updated last year
- Code repo for the ICML 2021 paper "Making Paper Reviewing Robust to Bid Manipulation Attacks".☆10Sep 15, 2021Updated 4 years ago
- Multi-view Broad Learning Systerm☆10Mar 20, 2022Updated 3 years ago
- ☆13Mar 16, 2025Updated 11 months ago