Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning
☆24Jun 25, 2025Updated 10 months ago
Alternatives and similar repositories for CoVo
Users that are interested in CoVo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Mar 2, 2026Updated 2 months ago
- ☆17Mar 14, 2025Updated last year
- Official repo for "StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation"☆25Apr 22, 2026Updated last month
- Official implementation for Text Generation Beyond Discrete Token Sampling☆25Aug 11, 2025Updated 9 months ago
- RFTT: Reasoning with Reinforced Functional Token Tuning☆29Feb 12, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆50Mar 31, 2026Updated last month
- An exploration of LLM steering☆26Jun 15, 2024Updated last year
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆16Apr 15, 2026Updated last month
- The repo contains the code and dataset for the World Models Track of GigaBrain Challenge 2026 CVPR Workshop.☆59Apr 8, 2026Updated last month
- [EMNLP-2025] R1-Zero on ANY TASK☆30Nov 9, 2025Updated 6 months ago
- End-to-end optimal quadcopter control through Supervised Learning☆25Oct 6, 2024Updated last year
- ☆39Jul 16, 2025Updated 10 months ago
- ☆14Feb 24, 2025Updated last year
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆25Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Pytorch code for Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC☆11Mar 1, 2021Updated 5 years ago
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…☆21Apr 9, 2025Updated last year
- Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks☆72May 7, 2026Updated 2 weeks ago
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- Official code for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models (NeurIPS 2023)☆13Mar 4, 2024Updated 2 years ago
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Example Code for the Conditional Action Trees Paper☆12May 24, 2021Updated 4 years ago
- ☆70Dec 7, 2025Updated 5 months ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆35Jun 23, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆34Sep 19, 2025Updated 8 months ago
- ☆10Feb 22, 2023Updated 3 years ago
- ☆24Jun 13, 2024Updated last year
- Source code for "Continuous Regularized Wasserstein Barycenters" [NeurIPS 2020].☆16Nov 4, 2020Updated 5 years ago
- Unofficial implementation of Variational Diffusion Models in PyTorch (Lightning)☆11Aug 31, 2023Updated 2 years ago
- ☆15Dec 3, 2024Updated last year
- 利用Airsim做无人机编队仿真,持续更新中。☆32Mar 26, 2021Updated 5 years ago
- Code for "Semantic Perturbations with Normalizing Flows for Improved Generalization"☆11Jul 13, 2021Updated 4 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆17Mar 2, 2023Updated 3 years ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆39Jul 14, 2025Updated 10 months ago
- Code for the ACL 2023 paper: "Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Sc…☆35Sep 16, 2023Updated 2 years ago
- Gradient Estimation with Discrete Stein Operators (NeurIPS 2022)☆17Nov 14, 2023Updated 2 years ago
- This is a repository for DKI group concerning the LLM-related papers alongside with code.☆38Updated this week
- The implementation of "An Imitative Reinforcement Learning Framework for Pursuit-Lock-Launch Missions"☆35Oct 29, 2025Updated 6 months ago
- [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing☆37Aug 19, 2024Updated last year