Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)
☆58Nov 8, 2024Updated last year
Alternatives and similar repositories for Noise-Contrastive-Alignment
Users that are interested in Noise-Contrastive-Alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆36Feb 11, 2025Updated last year
- Official implementation of HEGNN, a novel high-degree equivariant graph neural network proposed in the NeurIPS 2024 paper 'Are High-Degre…☆33Nov 8, 2024Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Official repository for ALT (ALignment with Textual feedback).☆10Jul 25, 2024Updated last year
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆30Jan 10, 2026Updated 2 months ago
- [NeurIPS 2024] The implementation for the paper "Geometric Trajectory Diffusion Models".☆35Jul 22, 2025Updated 8 months ago
- Equivariant Diffusion for Crystal Structure Prediction (ICML 2024)☆29Aug 20, 2024Updated last year
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- ☆131Oct 1, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆28Aug 9, 2025Updated 7 months ago
- Short RL☆18May 26, 2025Updated 9 months ago
- [ICML 2025 Spotlight] Direct Discriminative Optimization: Reinforcing Diffusion/Autoregressive with GAN Discrimination☆118Jan 27, 2026Updated last month
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- ☆59Aug 22, 2024Updated last year
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆27Feb 21, 2026Updated last month
- ☆21Feb 15, 2024Updated 2 years ago
- ☆46Jun 11, 2025Updated 9 months ago
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 2 years ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,232Aug 27, 2025Updated 6 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆53Jun 24, 2024Updated last year
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 6 years ago
- Use time-splits for Materials Project entries for generative modeling benchmarking.☆12Mar 12, 2026Updated last week
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 5 years ago
- Collection of forcing related autoregressive video Gen☆96Feb 27, 2026Updated 3 weeks ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 10 months ago
- Welcome to the 'In Context Learning Theory' Reading Group☆30Nov 8, 2024Updated last year
- Ultra-minimal autoregressive diffusion model for image generation☆21Dec 26, 2025Updated 2 months ago
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated 11 months ago
- ☆27Updated this week
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆36Jul 10, 2025Updated 8 months ago
- Official code for "Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching" (ICML 2022)☆65Sep 14, 2022Updated 3 years ago
- ☆28Nov 10, 2025Updated 4 months ago
- ☆52Jun 13, 2025Updated 9 months ago
- A list of research resources that I've appreciated.☆12Dec 10, 2019Updated 6 years ago
- Official repository for the paper "Peptide design through binding interface mimicry with PepMimic" accepted by Nature Biomedical Engineer…☆29Oct 20, 2025Updated 5 months ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆25Jan 23, 2024Updated 2 years ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago