NeurIPS 2025: Discriminative Constrained Optimization for Reinforcing Large Reasoning Models
☆51Feb 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for DisCO
Users that are interested in DisCO are comparing it to the libraries listed below
Sorting:
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 8 months ago
- [IROS'25] COCMT☆12Aug 14, 2025Updated 6 months ago
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆41Sep 24, 2024Updated last year
- ArcherCodeR is an open-source initiative enhancing code reasoning in large language models through scalable, rule-governed reinforcement …☆44Aug 6, 2025Updated 6 months ago
- The OlymMATH dataset☆23Jun 1, 2025Updated 9 months ago
- ☆18Dec 12, 2025Updated 2 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆52Oct 19, 2024Updated last year
- ☆28Feb 27, 2025Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- ☆11Mar 31, 2022Updated 3 years ago
- This repository is the implementation of our paper: Local Correntropy Matrix Representation for Hyperspectral Image Classification, which…☆10Apr 21, 2022Updated 3 years ago
- MemRec☆37Jan 16, 2026Updated last month
- Deploying a custom pytorch model to AWS Sagemaker using terraform and FastAPI☆10Nov 10, 2023Updated 2 years ago
- Multimodal data loader compatible with pytorch and tensorflow☆12Aug 14, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- An IOT based mobile application to monitor the vitals such as ECG, Body Temperature, Blood Pressure using an ESP32 DevKit and React Nativ…☆11Nov 14, 2024Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- [JMLR] Gradual Domain Adaptation: Theory and Algorithms☆11Jan 14, 2025Updated last year
- HFMF: Hierarchical Fusion Meets Multi-Stream Models for Deepfake Detection☆13Jan 6, 2025Updated last year
- ☆11Jan 28, 2024Updated 2 years ago
- Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…☆19May 28, 2025Updated 9 months ago
- A Spherical Hidden Markov Model for Semantics-Rich Human Mobility Modeling (AAAI 2018)☆10Oct 23, 2020Updated 5 years ago
- ☆14Jul 27, 2025Updated 7 months ago
- Go package that wraps around OpenAI HTTP APIs☆12Mar 2, 2023Updated 3 years ago
- Umap is a Python library that transforms OpenStreetMap data into customized maps with minimal code. Create minimalist or multi-layered ma…☆13Updated this week
- Code for paper "Open Relation and Event Type Discovery with Type Abstraction". EMNLP 22'☆16Nov 30, 2022Updated 3 years ago
- ☆18Mar 2, 2025Updated last year
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 6 months ago
- This example shows how a multitenant service can distribute requests evenly among multiple Azure OpenAI Service instances and manage toke…☆13Jan 9, 2024Updated 2 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- ☆12Mar 25, 2024Updated last year
- ☆10Aug 6, 2024Updated last year
- 해커그라운드 해커톤 2024☆12Aug 26, 2024Updated last year
- ☆14Aug 5, 2022Updated 3 years ago
- [ICML 2023] Protecting Language Generation Models via Invisible Watermarking☆13Sep 8, 2023Updated 2 years ago
- Accelerating RL for LLM Reasoning with Optimal Advantage Regression☆35May 30, 2025Updated 9 months ago
- Official repository for Targeted Unlearning with Single Layer Unlearning Gradient (SLUG), ICML 2025☆15Aug 10, 2025Updated 6 months ago
- [COLM '25] Single-Pass Document Scanning for Question Answering☆12Aug 20, 2025Updated 6 months ago
- Danfeng Hong, Naoto Yokoya, Jian Xu, Xiaoxiang Zhu. Joint & Progressive Learning from High-Dimensional Data for Multi-Label Classificatio…☆11Nov 14, 2021Updated 4 years ago