The official code of Multi-player Nash Preference Optimization [ICLR 2026]
☆35Feb 4, 2026Updated 2 months ago
Alternatives and similar repositories for MNPO
Users that are interested in MNPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to replicate the Representation Noising paper and tools for evaluating defences against harmful fine-tuning☆24Dec 12, 2024Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- Fingerprint large language models☆51Jul 11, 2024Updated last year
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- [NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation☆20Jan 3, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An official implementation of "Rethinking Graph Backdoor Attacks: A Distribution-Preserving Perspective" (KDD 2024)☆12Sep 16, 2024Updated last year
- [ICLR 2025 Spotlight] Weak-to-strong preference optimization: stealing reward from weak aligned model☆18Feb 24, 2025Updated last year
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- [EMNLP 2025 Main] The official repo of MMLU-ProX benchmark.☆27Aug 26, 2025Updated 7 months ago
- Code for ICLR 2025 Paper "GenARM: Reward Guided Generation with Autoregressive Reward Model for Test-time Alignment"☆21Feb 10, 2025Updated last year
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- [KDD'23] This is the code repo for our KDD'23 paper "DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling".☆11Jun 14, 2023Updated 2 years ago
- ☆19May 3, 2025Updated 11 months ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- Test and benchmark your Rust library on mobile devices with ease.☆13Jul 17, 2023Updated 2 years ago
- ☆10Jun 14, 2025Updated 10 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆37Feb 22, 2025Updated last year
- PAHF Personalized Agent from Human Feedback☆45Mar 6, 2026Updated last month
- Code and Hummingbird dataset for EMNLP 2021 paper "Does BERT Learn as Humans Perceive? Understanding Linguistic Styles through Lexica"☆14Apr 13, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Make reasoning models scalable☆49May 31, 2025Updated 10 months ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆14Feb 20, 2024Updated 2 years ago
- ☆30Feb 18, 2026Updated last month
- A geometric deep learning method for refining and assessing protein complex structures.☆16Oct 22, 2022Updated 3 years ago
- Build A Simple Web App With Sveltekit and Appwrite☆11Apr 3, 2023Updated 3 years ago
- Code for Representation Bending Paper☆17Jul 15, 2025Updated 9 months ago
- ☆15May 28, 2024Updated last year
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆28Nov 1, 2025Updated 5 months ago
- Official website for the TRON (Token Reduced Object Notation) format☆38Nov 29, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Agentic Virtual Lab☆19Nov 30, 2025Updated 4 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆29Feb 17, 2025Updated last year
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆59Nov 24, 2024Updated last year
- PyCausalSim is a Python framework for discovering and validating causal relationships through simulation. Unlike traditional analytics th…☆32Dec 8, 2025Updated 4 months ago
- BC-Design: A Biochemistry-Aware Framework for High-Precision Inverse Protein Folding https://www.biorxiv.org/content/10.1101/2024.10.28.6…☆21Nov 24, 2025Updated 4 months ago
- Official repo for "Foundation Models for Remote Sensing and Earth Observation: A Survey"☆50Nov 25, 2024Updated last year
- [CVPR 2023] 3D Representation Learning via Foreground Aware Feature Contrast☆42Mar 26, 2024Updated 2 years ago