sastpg/CoVo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sastpg/CoVo)

sastpg / CoVo

Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM Reasoning

☆25

Alternatives and similar repositories for CoVo

Users that are interested in CoVo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sastpg / RFTT
View on GitHub
RFTT: Reasoning with Reinforced Functional Token Tuning
☆29Feb 12, 2026Updated 5 months ago
wwangwitsel / ConfDiff
View on GitHub
[NeurIPS'23] Binary Classification with Confidence Difference
☆10May 13, 2024Updated 2 years ago
princeton-pli / STAT
View on GitHub
Skill-Targeted Adaptive Training
☆24Mar 12, 2026Updated 4 months ago
T-Lab-CUHKSZ / G2RPO-A
View on GitHub
[ACL 2026] G2RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance
☆16May 20, 2026Updated 2 months ago
ml-lab-htw / llm-trees
View on GitHub
Official repo: “Oh LLM, I’m Asking Thee, Please Give Me a Decision Tree”: Zero-Shot Decision Tree Induction and Embedding with Large Lang…
☆16Jul 17, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
DripNowhy / Sherlock
View on GitHub
[NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"
☆31Jun 4, 2026Updated last month
Zanette-Labs / speed-rl
View on GitHub
☆18Feb 2, 2026Updated 5 months ago
thinkwee / NOVER
View on GitHub
[EMNLP-2025] R1-Zero on ANY TASK
☆32Nov 9, 2025Updated 8 months ago
tmlr-group / TriMem
View on GitHub
[arXiv:2605.19952] "Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory"
☆16May 20, 2026Updated 2 months ago
amayuelas / multi-agent-attack
View on GitHub
MutliAgent Attack
☆15Oct 3, 2024Updated last year
yifeiwang77 / Self-Correction
View on GitHub
☆20Nov 3, 2024Updated last year
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
YujunZhou / EVOL-RL
View on GitHub
Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).
☆51Mar 31, 2026Updated 3 months ago
ucla-mobility / TurboTrain
View on GitHub
[ICCV 2025] TurboTrain: Towards Efficient and Balanced Multi-Task Learning for Multi-Agent Perception and Prediction.
☆17Jan 31, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yf-he / EvoTest
View on GitHub
EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems (ICLR'26)
☆24Nov 3, 2025Updated 8 months ago
wenquanlu / huginn-latent-cot
View on GitHub
[COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…
☆20Oct 4, 2025Updated 9 months ago
ars22 / e3
View on GitHub
☆20Sep 16, 2025Updated 10 months ago
jiquan123 / TIER
View on GitHub
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
☆10Mar 1, 2025Updated last year
QingyangZhang / Label-Free-RLVR
View on GitHub
☆311Jul 6, 2025Updated last year
QingyangZhang / EMPO
View on GitHub
[NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method
☆103Nov 24, 2025Updated 8 months ago
Leey21 / A-Data-Centric-Study
View on GitHub
☆18Mar 2, 2026Updated 4 months ago
TianhongDai / metaworld-sac
View on GitHub
☆12Aug 28, 2020Updated 5 years ago
xuyige / SoftCoT
View on GitHub
ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-of…
☆92May 30, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
seamoke / DPH-RL
View on GitHub
This is the official implementation of paper "The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement…
☆20Feb 10, 2026Updated 5 months ago
zenghy96 / Reliable-Source-Approximation
View on GitHub
Reliable Source Approximation: Source-Free Domain Adaptation for Vestibular Schwannoma MRI Segmentation
☆11Dec 28, 2024Updated last year
zz1358m / ATP-Latent-master
View on GitHub
☆17Feb 4, 2026Updated 5 months ago
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
WEIRDLabUW / dispo
View on GitHub
Distributional Successor Features Enable Zero-Shot Policy Optimization
☆15Apr 11, 2025Updated last year
kevinliang888 / Machine-Bullshit
View on GitHub
Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models
☆27Sep 14, 2025Updated 10 months ago
M-3LAB / Look-Inside-for-More
View on GitHub
This is the Reproducible Realisation of the AAAI25 paper "Look Inside for More: Internal Spatial Modality Perception for 3D Anomaly Detec…
☆16Oct 5, 2025Updated 9 months ago
priyankjaini / discFlowMH
View on GitHub
Pytorch code for Sampling in Combinatorial Spaces with SurVAE Flow Augmented MCMC
☆11Mar 1, 2021Updated 5 years ago
ha0ransun / Path-Auxiliary-Sampler
View on GitHub
☆10Feb 22, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
augustwester / gflownet
View on GitHub
A PyTorch implementation of a Generative Flow Network (GFlowNet) proposed by Bengio et al. (2021)
☆45Sep 7, 2023Updated 2 years ago
Kyyle2114 / Convolutional-Adapter-for-Segment-Anything
View on GitHub
CAD - Memory Efficient Convolutional Adapter for Segment Anything
☆12Oct 4, 2024Updated last year
MangoKiller / SimOAR_OAR
View on GitHub
☆11Nov 8, 2023Updated 2 years ago
ec2604 / ContraBAR
View on GitHub
☆13May 21, 2023Updated 3 years ago
Dakingrai / neuron-analysis-cot-arithmetic-reasoning
View on GitHub
☆14Feb 24, 2025Updated last year
poteminr / gigasmol
View on GitHub
💀 gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.
☆15Oct 23, 2025Updated 9 months ago
ido90 / RobustMetaRL
View on GitHub
A variant of Varibad that is robust to difficult tasks
☆11Aug 30, 2023Updated 2 years ago