RLVR Testing and Training
☆23Aug 28, 2025Updated 7 months ago
Alternatives and similar repositories for Reinforcement-learning-with-verifable-rewards-Learnings
Users that are interested in Reinforcement-learning-with-verifable-rewards-Learnings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Prompt-driven automation platform - Transform natural language into executable workflows☆32Jul 13, 2025Updated 9 months ago
- generate informative knowledge graph from text using open source models , ollama☆23Sep 1, 2025Updated 7 months ago
- A Rust embedded-hal HAL for all MCUs in the PSoC6 family☆11Dec 20, 2019Updated 6 years ago
- Demo of building and intergraition MCP Server☆21Apr 9, 2025Updated last year
- ☆16Mar 21, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Simple, Explainable Vision Language Model for detecting manifacturing defects into products☆14Sep 23, 2025Updated 6 months ago
- jonnew's personal CAD library☆15Apr 7, 2026Updated last week
- ☆13May 25, 2023Updated 2 years ago
- AI in A Box☆25Feb 23, 2026Updated last month
- A Survey Analyzing Generalization in Deep Reinforcement Learning☆37Oct 31, 2024Updated last year
- A Pretty simple utility built with Vue.js to check if a domain is live or down.☆19Apr 24, 2018Updated 7 years ago
- ACL style for Typst☆22Jan 27, 2026Updated 2 months ago
- Reinforcement learning algorithm implementation☆10Oct 31, 2021Updated 4 years ago
- ☆12Oct 19, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- An open source deep learning library for Unity.☆17Mar 15, 2026Updated last month
- C++ version of pyannote audio overlapped speech detection pipeline☆13Feb 14, 2024Updated 2 years ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆17Mar 1, 2023Updated 3 years ago
- Out-of-the-box tool that cross compiles a rust tauri project for raspberry pi target.☆23Jan 15, 2026Updated 3 months ago
- Laravel-based autonomous AI agent platform with cyclic thinking, persistent memory and real-time code execution☆38Updated this week
- SiDeGame - Simplified Defusal Game☆13Apr 17, 2025Updated last year
- iTunes inside a docker container using wine☆18Nov 14, 2014Updated 11 years ago
- Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies☆19Mar 10, 2021Updated 5 years ago
- An agent for playing Atari games running on a Teensy microcontroller☆15Nov 11, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Sep 28, 2023Updated 2 years ago
- Fulloch - The Fully Local Home Voice Assistant☆51Feb 10, 2026Updated 2 months ago
- LLM-based Multi-dimensional Debate Judge with Iterative Chronological Analysis☆19Oct 1, 2025Updated 6 months ago
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago
- Research that compiles.☆78Updated this week
- Synthetic Data Generator for Machine Learning Pipelines☆33Sep 2, 2025Updated 7 months ago
- Raspberry Pi HUD/dashboard framework☆19Sep 15, 2023Updated 2 years ago
- ☆11Mar 25, 2025Updated last year
- Modelling heterogeneous distributions with an Uncountable Mixture of Asymmetric Laplacians☆20Oct 27, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Sep 25, 2025Updated 6 months ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆30Dec 29, 2025Updated 3 months ago
- Auto Causal Inference Assistant for Banking using LangGraph and MCP☆23Jun 28, 2025Updated 9 months ago
- 4 bits quantization of LLaMa using GPTQ☆12Jun 2, 2023Updated 2 years ago
- Meta-analysis toolbox for basic research applications. Developed in MATLAB R2016b.☆13Apr 21, 2019Updated 6 years ago
- a tool to parse source code into a knowledge graph☆33Dec 21, 2025Updated 3 months ago
- Host LLM via text-generation-inference☆16Dec 5, 2023Updated 2 years ago