Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)
☆58Nov 8, 2024Updated last year
Alternatives and similar repositories for Noise-Contrastive-Alignment
Users that are interested in Noise-Contrastive-Alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆38Feb 11, 2025Updated last year
- Official implementation of HEGNN, a novel high-degree equivariant graph neural network proposed in the NeurIPS 2024 paper 'Are High-Degre…☆34Nov 8, 2024Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated 2 years ago
- Official repository for ALT (ALignment with Textual feedback).☆10Jul 25, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"☆31Jan 10, 2026Updated 4 months ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- ☆131Oct 1, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 9 months ago
- Short RL☆18Apr 16, 2026Updated last month
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- ☆59Aug 22, 2024Updated last year
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆37Feb 21, 2026Updated 3 months ago
- ☆47Jun 11, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation for NeurIPS 2023 paper "Equivariant Flow Matching with Hybrid Probability Transport for 3D Molecule Generation"☆44May 30, 2024Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,259Aug 27, 2025Updated 8 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆53Jun 24, 2024Updated last year
- Use time-splits for Materials Project entries for generative modeling benchmarking.☆12Mar 12, 2026Updated 2 months ago
- Project code for training LLMs to write better unit tests + code☆22May 19, 2025Updated last year
- Collection of forcing related autoregressive video Gen☆98Mar 31, 2026Updated last month
- Welcome to the 'In Context Learning Theory' Reading Group☆31Nov 8, 2024Updated last year
- Corpus to accompany: "Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding"☆11Apr 11, 2025Updated last year
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆19Oct 6, 2025Updated 7 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆36Jul 10, 2025Updated 10 months ago
- ☆23Sep 19, 2024Updated last year
- ☆13Dec 13, 2023Updated 2 years ago
- ☆53Jun 13, 2025Updated 11 months ago
- Official repository for the paper "Peptide design through binding interface mimicry with PepMimic" accepted by Nature Biomedical Engineer…☆34Oct 20, 2025Updated 7 months ago
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆26Jan 23, 2024Updated 2 years ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- Fully open reproduction of DeepSeek-R1☆11Mar 24, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)☆15Aug 16, 2024Updated last year
- 集中管理所有的prompt。☆14Nov 27, 2024Updated last year
- Extensive Self-Contrast Enables Feedback-Free Language Model Alignment☆20Apr 2, 2024Updated 2 years ago
- EMNLP'2023 (Findings): Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!☆47Apr 12, 2024Updated 2 years ago
- This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"☆73Apr 22, 2025Updated last year
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Aug 20, 2024Updated last year
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year