Al-th / grpo_experimentLinks
Experiment on reimplementation of GRPO RL
☆17Updated 11 months ago
Alternatives and similar repositories for grpo_experiment
Users that are interested in grpo_experiment are comparing it to the libraries listed below
Sorting:
- Framework for specifying and proving properties—such as robustness, fairness, and interpretability—of machine learning models using Lean …☆75Updated 5 months ago
- A star for organising blocks and playing with transformers.☆23Updated last year
- A tiny autograd engine with a Jax-like API☆74Updated 6 months ago
- fast combinations calculation in jax☆39Updated last year
- Proof of thought : LLM-based reasoning using Z3 theorem proving with multiple backend support (SMT2 and JSON DSL)☆363Updated 3 months ago
- A Full Transcript of the Lighthill Debate on AI from 1973, with Introductory Remarks☆33Updated last year
- time to learn mlx☆42Updated 4 months ago
- A MCP server for symbolic manipulation of mathematical expressions☆51Updated 7 months ago
- Copies of prolog solvers for use from python☆19Updated last year
- Rewriting Principia Mathematica in Lean☆136Updated 2 weeks ago
- A probabilistic approximate DNF counter☆39Updated last month
- ☆69Updated 3 months ago
- A library for building software agents using behavior trees and language models.☆90Updated 11 months ago
- A playground to make it easy to try crazy things☆33Updated last month
- Tensor library & inference framework for machine learning☆117Updated 3 months ago
- Code for the Fractured Entangled Representation Hypothesis position paper!☆221Updated 2 months ago
- a categorical deep learning compiler☆207Updated 3 months ago
- Designing bridge trusses with Pytorch autograd☆61Updated last year
- Advanced Python Function Debugging with MCP Integration.☆58Updated 7 months ago
- LLM verified with Monte Carlo Tree Search☆284Updated 9 months ago
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆239Updated 2 years ago
- ☆91Updated this week
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 9 months ago
- Grow virtual creatures in static and physics simulated environments.☆53Updated last year
- Automatically extract executable programs from pruned mechanistic circuits, extending OpenAI's Sparse Circuits☆60Updated 2 months ago
- A library for reproducible deep learning.☆83Updated 2 months ago
- R.L. methods and techniques.☆199Updated this week
- ☆36Updated 4 months ago
- Model Context Protocol (MCP) server for constraint optimization and solving"☆148Updated 4 months ago