Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning
☆25Jun 25, 2025Updated last year
Alternatives and similar repositories for CoVo
Users that are interested in CoVo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Mar 2, 2026Updated 4 months ago
- ☆18Mar 14, 2025Updated last year
- Official repo for "StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation"☆27Updated this week
- Official implementation for Text Generation Beyond Discrete Token Sampling☆25Aug 11, 2025Updated 10 months ago
- RFTT: Reasoning with Reinforced Functional Token Tuning☆29Feb 12, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆51Mar 31, 2026Updated 3 months ago
- An exploration of LLM steering☆28Jun 15, 2024Updated 2 years ago
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆18Apr 15, 2026Updated 2 months ago
- The repo contains the code and dataset for the World Models Track of GigaBrain Challenge 2026 CVPR Workshop.☆59Apr 8, 2026Updated 2 months ago
- [EMNLP-2025] R1-Zero on ANY TASK☆31Nov 9, 2025Updated 7 months ago
- End-to-end optimal quadcopter control through Supervised Learning☆26Oct 6, 2024Updated last year
- ☆43Jul 16, 2025Updated 11 months ago
- ☆14Feb 24, 2025Updated last year
- AI-powered Decision Tracing For Financial Institutions☆83Mar 2, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆27May 17, 2026Updated last month
- Pytorch code for Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC☆11Mar 1, 2021Updated 5 years ago
- the official repo for EMNLP 2024 (main) paper "EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimo…☆21Apr 9, 2025Updated last year
- [NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs☆23Oct 15, 2024Updated last year
- Official code for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models (NeurIPS 2023)☆14Mar 4, 2024Updated 2 years ago
- Curated LLM (ICML 2024)☆14Oct 23, 2024Updated last year
- Example Code for the Conditional Action Trees Paper☆12May 24, 2021Updated 5 years ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆35Jun 23, 2025Updated last year
- ☆73Jun 18, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆34Sep 19, 2025Updated 9 months ago
- Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks☆84May 7, 2026Updated last month
- ☆10Feb 22, 2023Updated 3 years ago
- ☆25Jun 13, 2024Updated 2 years ago
- Source code for "Continuous Regularized Wasserstein Barycenters" [NeurIPS 2020].☆16Nov 4, 2020Updated 5 years ago
- Unofficial implementation of Variational Diffusion Models in PyTorch (Lightning)☆12Aug 31, 2023Updated 2 years ago
- ☆15Dec 3, 2024Updated last year
- 利用Airsim做无人机编队仿真,持续更新中。☆32Mar 26, 2021Updated 5 years ago
- Code for "Semantic Perturbations with Normalizing Flows for Improved Generalization"☆11Jul 13, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆18Oct 22, 2024Updated last year
- ☆17Mar 2, 2023Updated 3 years ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆39Jul 14, 2025Updated 11 months ago
- Code for the ACL 2023 paper: "Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Sc…☆35Sep 16, 2023Updated 2 years ago
- This is a repository for DKI group concerning the LLM-related papers alongside with code.☆40May 20, 2026Updated last month
- The implementation of "An Imitative Reinforcement Learning Framework for Pursuit-Lock-Launch Missions"☆35Oct 29, 2025Updated 8 months ago
- [ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing☆37Aug 19, 2024Updated last year