RLVR Testing and Training
☆23Aug 28, 2025Updated 9 months ago
Alternatives and similar repositories for Reinforcement-learning-with-verifable-rewards-Learnings
Users that are interested in Reinforcement-learning-with-verifable-rewards-Learnings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prompt-driven automation platform - Transform natural language into executable workflows☆34Jul 13, 2025Updated 11 months ago
- generate informative knowledge graph from text using open source models , ollama☆23Sep 1, 2025Updated 9 months ago
- Demo of building and intergraition MCP Server☆20Apr 9, 2025Updated last year
- A Simple, Explainable Vision Language Model for detecting manifacturing defects into products☆15Sep 23, 2025Updated 8 months ago
- ☆14May 25, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- AI in A Box☆27Jun 1, 2026Updated 2 weeks ago
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆37Oct 31, 2024Updated last year
- Reinforcement learning algorithm implementation☆10Oct 31, 2021Updated 4 years ago
- ☆12Oct 19, 2020Updated 5 years ago
- An open source deep learning library for Unity.☆17Jun 11, 2026Updated last week
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆17Mar 1, 2023Updated 3 years ago
- SiDeGame - Simplified Defusal Game☆13Apr 17, 2025Updated last year
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆20Mar 10, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An agent for playing Atari games running on a Teensy microcontroller☆15Nov 11, 2022Updated 3 years ago
- LLM-based Multi-dimensional Debate Judge with Iterative Chronological Analysis☆20Oct 1, 2025Updated 8 months ago
- Research that compiles.☆85Apr 19, 2026Updated last month
- Synthetic Data Generator for Machine Learning Pipelines☆33Sep 2, 2025Updated 9 months ago
- ☆13Mar 25, 2025Updated last year
- Modelling heterogeneous distributions with an Uncountable Mixture of Asymmetric Laplacians☆20Oct 27, 2019Updated 6 years ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆35Dec 29, 2025Updated 5 months ago
- Auto Causal Inference Assistant for Banking using LangGraph and MCP☆23Jun 28, 2025Updated 11 months ago
- a tool to parse source code into a knowledge graph☆34Dec 21, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆54Dec 5, 2025Updated 6 months ago
- Create topological graph for image segments.☆23Sep 28, 2024Updated last year
- Mini RL Lab☆17Jun 17, 2024Updated 2 years ago
- ☆31Dec 12, 2025Updated 6 months ago
- Graph convolutional memory☆17May 26, 2022Updated 4 years ago
- Fulloch - The Fully Local Home Voice Assistant☆98Updated this week
- Lego for GRPO☆30May 27, 2025Updated last year
- Production-grade agent orchestration for Claude Code - 11 agents, 46 MCP tools, SQLite+FTS5, drift detection, consensus checkpoints☆51Jun 8, 2026Updated last week
- Implementation of RL-100, Performant Robotic Manipulation with Real-World Reinforcement Learning☆64Nov 26, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A training framework for large-scale language models based on Megatron-Core, the COOM Training Framework is designed to efficiently handl…☆27Nov 14, 2025Updated 7 months ago
- utilizing RL and GNN for trajectory planning(co-work)☆14Jul 28, 2023Updated 2 years ago
- A deep learning agent for The Legend of Zelda (nes)☆28Apr 18, 2026Updated 2 months ago
- Force-directed graph as a React Three Fiber component☆47Jul 3, 2025Updated 11 months ago
- A Vietnamese Text-to-Speech library that provides high-quality speech synthesis with voice cloning capabilities☆105Jul 14, 2025Updated 11 months ago
- ProfitsBot V0 are a set of LLM experiments training open source langage models with loras for financial applications☆19May 27, 2023Updated 3 years ago
- Un-LOCC: Universal Lossy Optical Context Compression for Vision-Based Language Models Achieve nearly 3x token compression at over 93% ret…☆71Feb 3, 2026Updated 4 months ago