mansicer/self-verification

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mansicer/self-verification)

mansicer / self-verification

☆18

Alternatives and similar repositories for self-verification

Users that are interested in self-verification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liyc-ai / RL-pytorch
View on GitHub
A beginner-friendly repository on Deep Reinforcement Learning (RL), written in PyTorch.
☆27Mar 19, 2026Updated 4 months ago
KAIST-Visual-AI-Group / Psi-Sampler
View on GitHub
[NeurIPS 2025, Spotlight] Official code for Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score-Based Genera…
☆18Feb 3, 2026Updated 5 months ago
XueruiSu / Trust-Region-Preference-Approximation
View on GitHub
Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning
☆15Jun 28, 2025Updated last year
polixir / NeoRL2
View on GitHub
☆20Oct 27, 2025Updated 8 months ago
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DrZero0 / MACC
View on GitHub
The implementation of IJCAI'22 paper "Multi-Agent Concentrative Coordination with Decentralized Task Representation".
☆18May 1, 2022Updated 4 years ago
RUCBM / LaSeR
View on GitHub
[ICLR 2026] Official repository for the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding"
☆36Oct 28, 2025Updated 8 months ago
hkust-nlp / mstar
View on GitHub
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆75Jul 13, 2025Updated last year
lamda-bbo / madac
View on GitHub
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆26Mar 6, 2023Updated 3 years ago
x35f / unstable_baselines
View on GitHub
Re-implementations of SOTA RL algorithms.
☆137Sep 7, 2023Updated 2 years ago
OpenCausaLab / ARise
View on GitHub
☆26Jul 26, 2025Updated 11 months ago
TheRoadQaQ / ReLIFT
View on GitHub
Official Repository of "Learning what reinforcement learning can't"
☆84Dec 30, 2025Updated 6 months ago
typoverflow / UtilsRL
View on GitHub
A python module designed for agile RL algorithm developing.
☆26Jul 11, 2024Updated 2 years ago
QingyangZhang / TEMPO
View on GitHub
Scaling Test-time Training for LLM Reasoning
☆27Apr 14, 2026Updated 3 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
KAIST-Visual-AI-Group / BezierFlow
View on GitHub
[ICLR 2026] Official code for BézierFlow: Learning Bézier Stochastic Interpolant Schedulers for Few-Step Generation
☆21Apr 13, 2026Updated 3 months ago
CMU-AIRe / POPE
View on GitHub
☆27Jan 31, 2026Updated 5 months ago
DripNowhy / Sherlock
View on GitHub
[NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"
☆31Jun 4, 2026Updated last month
Keely-Ai / F2D2
View on GitHub
Joint Distillation for Fast Likelihood Evaluation and Sampling in Flow-based Models
☆22Mar 5, 2026Updated 4 months ago
weitongseu / PCL
View on GitHub
☆10Jul 11, 2022Updated 4 years ago
chwoong / LiRE
View on GitHub
Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)
☆18Jun 18, 2024Updated 2 years ago
czp16 / Bridge-LLM-reasoning
View on GitHub
Behavior Injection: Preparing Language Models for Reinforcement Learning (NeurIPS 2025)
☆17Jul 1, 2025Updated last year
julienroyd / coordination-marl
View on GitHub
Code to reproduce experiments from:
☆10Dec 11, 2020Updated 5 years ago
aaronserianni / attention-iou
View on GitHub
[CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps
☆13Mar 26, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zhangxy-2019 / critique-GRPO
View on GitHub
[ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback
☆70Jun 3, 2026Updated last month
daveredrum / 3d-captioning
View on GitHub
Generate descriptions automatically for 3D shapes in ShapeNet via cross-modal joint embedding
☆15Jan 4, 2019Updated 7 years ago
mhsung / libigl-renderer
View on GitHub
☆19Mar 14, 2023Updated 3 years ago
kywch / brax-trainer
View on GitHub
Brax + Pufferlib + CARBS for gpu-accelerated robotics RL
☆12Jun 12, 2025Updated last year
zongqianwu / ST-COT
View on GitHub
(ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training
☆13Feb 15, 2025Updated last year
FanmingL / SmartLogger
View on GitHub
☆12May 14, 2024Updated 2 years ago
apexrl / EBIL-torch
View on GitHub
Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>
☆12Oct 8, 2021Updated 4 years ago
orvindemsy / EA-wLTL
View on GitHub
combining Euclidean Alignment (EA) and weighted LTL to classify MI-based EEG
☆11Jun 13, 2024Updated 2 years ago
Moreland-cas / AKM
View on GitHub
[RA-L`2026] Active Kinematic Modelling for Precise Manipulation of Unseen Articulated Objects
☆15Jan 9, 2026Updated 6 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
MJ-Jang / BECEL
View on GitHub
☆10Jan 28, 2024Updated 2 years ago
ClawGym / ClawGym-Agents
View on GitHub
☆33Jun 30, 2026Updated 3 weeks ago
liuyuchen-cz / F-OAL
View on GitHub
This is the source code of F-OAL: Forward-only Online Analytic Learning with Fast Training and Low Memory Footprint in Class Incremental …
☆11Oct 19, 2024Updated last year
cheolhong0916 / contrastive-probing
View on GitHub
☆15Jun 19, 2026Updated last month
TIGER-AI-Lab / VL-Rethinker
View on GitHub
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆189Jun 5, 2025Updated last year
wwhenxuan / S2Generator
View on GitHub
A series-symbol (S2) dual-modality data generation mechanism, enabling the unrestricted creation of high-quality time series data paired …
☆19Jun 25, 2026Updated 3 weeks ago
AI9Stars / AutoReproduce
View on GitHub
☆39Apr 10, 2026Updated 3 months ago