Pavankunchala / Reinforcement-learning-with-verifable-rewards-LearningsLinks

RLVR Testing and Training

☆23

Alternatives and similar repositories for Reinforcement-learning-with-verifable-rewards-Learnings

Users that are interested in Reinforcement-learning-with-verifable-rewards-Learnings are comparing it to the libraries listed below

Sorting:

VizuaraAI / truly-open-gpt-oss
A truly open version of gpt-oss which shows the entire pre-training from scratch
☆79Updated 3 months ago
asappresearch / josh-llm-simulation-training
☆31Updated 9 months ago
ideaweaver-ai / DeepSeek-Children-Stories-15M-model
☆109Updated 6 months ago
SakanaAI / natural_niches
The code repository of the paper: Competition and Attraction Improve Model Fusion
☆167Updated 3 months ago
reka-ai / rekaquant
☆62Updated 5 months ago
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆103Updated 11 months ago
EmpathYang / TinyHelen
Code for paper https://arxiv.org/abs/2501.00522
☆13Updated 7 months ago
ALucek / GRPO-Training
An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆37Updated 7 months ago
ideaweaver-ai / Tiny-Children-Stories-30M-model
☆122Updated 6 months ago
bradhilton / temporal-clue
Clue inspired puzzles for testing LLM deduction abilities
☆45Updated 8 months ago
OpenPipe / rl-experiments
OpenPipe Reinforcement Learning Experiments
☆32Updated 9 months ago
janhq / ReZero
☆159Updated 8 months ago
BKHMSI / mixture-of-cognitive-reasoners
Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization
☆36Updated last month
JakeFurtaw / Chat-RAG
Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…
☆23Updated 7 months ago
projektjoe / GPT-OSS
From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.
☆107Updated last month
thad0ctor / unsloth-5090-multiple
unsloth-5090-multiple
☆60Updated 6 months ago
onurbaran / stream-rag-agent
Streaming Retrieval-Augmented Generation (RAG) agent in Go. It consumes real-time data from Kafka topics, processes it in configurable wi…
☆25Updated 6 months ago
ysharma3501 / MiraTTS
A high quality and fast TTS repository
☆111Updated this week
sunnweiwei / PPP-Agent
☆92Updated last month
Antoine-Villiere / JacQues
JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.
☆22Updated last year
prateekvellala / retrieval-experiments
Exploring retrieval systems for language models
☆14Updated 8 months ago
willccbb / trl
Train transformer language models with reinforcement learning.
☆19Updated 9 months ago
nishchaljs / MobiRAG
☆29Updated 7 months ago
bernatsampera / event-deep-research
AI Agent that researches the lives of historical figures and extracts events into structured JSON timelines using LangGraph multi-agent o…
☆215Updated 2 months ago
MehulG / memX
A real-time shared memory layer for multi-agent LLM systems.
☆50Updated 5 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated last year
codelion / ellora
Enhancing LLMs with LoRA
☆193Updated 2 months ago
ArturTanona / grpo_unsloth_docker
☆57Updated 10 months ago
Forest-Person / smolResearcher
Use smol agents to do research and then update csv coumns with its findings.
☆41Updated 10 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆30Updated 6 months ago