Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)
☆58Nov 8, 2024Updated last year
Alternatives and similar repositories for Noise-Contrastive-Alignment
Users that are interested in Noise-Contrastive-Alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆38Feb 11, 2025Updated last year
- Official implementation of HEGNN, a novel high-degree equivariant graph neural network proposed in the NeurIPS 2024 paper 'Are High-Degre…☆34Nov 8, 2024Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated 2 years ago
- Official repository for ALT (ALignment with Textual feedback).☆10Jul 25, 2024Updated last year
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆31Jan 10, 2026Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2024] The implementation for the paper "Geometric Trajectory Diffusion Models".☆39Jul 22, 2025Updated 11 months ago
- Equivariant Diffusion for Crystal Structure Prediction (ICML 2024)☆29Aug 20, 2024Updated last year
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- ☆131Oct 1, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 10 months ago
- Short RL☆18Apr 16, 2026Updated 2 months ago
- [ICML 2025 Spotlight] Direct Discriminative Optimization: Reinforcing Diffusion/Autoregressive with GAN Discrimination☆123Jan 27, 2026Updated 5 months ago
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- ☆61Aug 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆21Feb 15, 2024Updated 2 years ago
- PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"☆13Apr 19, 2023Updated 3 years ago
- ☆47Jun 11, 2025Updated last year
- Implementation for NeurIPS 2023 paper "Equivariant Flow Matching with Hybrid Probability Transport for 3D Molecule Generation"☆44May 30, 2024Updated 2 years ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,262Aug 27, 2025Updated 10 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆54Jun 24, 2024Updated 2 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 7 years ago
- Use time-splits for Materials Project entries for generative modeling benchmarking.☆13Mar 12, 2026Updated 3 months ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Project code for training LLMs to write better unit tests + code☆22May 19, 2025Updated last year
- Collection of forcing related autoregressive video Gen☆98Mar 31, 2026Updated 3 months ago
- Welcome to the 'In Context Learning Theory' Reading Group☆31Nov 8, 2024Updated last year
- ☆12Dec 13, 2023Updated 2 years ago
- ☆27Jun 2, 2026Updated last month
- A list of research resources that I've appreciated.☆12Dec 10, 2019Updated 6 years ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)☆15Aug 16, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 集中管理所有的prompt。☆14Nov 27, 2024Updated last year
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated 2 years ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆73Apr 22, 2025Updated last year
- EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!☆47Apr 12, 2024Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Aug 20, 2024Updated last year
- Code to reproduce the experiments in the paper: Does CLIP Bind Concepts? Probing Compositionality in Large Image Models.☆16Oct 14, 2023Updated 2 years ago
- ☆51May 16, 2026Updated last month