Al-th / grpo_experimentLinks
Experiment on reimplementation of GRPO RL
☆12Updated 7 months ago
Alternatives and similar repositories for grpo_experiment
Users that are interested in grpo_experiment are comparing it to the libraries listed below
Sorting:
- Framework for specifying and proving properties—such as robustness, fairness, and interpretability—of machine learning models using Lean …☆66Updated last month
- A star for organising blocks and playing with transformers.☆23Updated last year
- Rewriting Principia Mathematica in Lean☆134Updated last week
- Copies of prolog solvers for use from python☆18Updated last year
- Advanced Python Function Debugging with MCP Integration.☆57Updated 3 months ago
- ☆67Updated 3 months ago
- Tensor library & inference framework for machine learning☆110Updated this week
- PILF: A IPWT-inspired bionic continual learning experiment focus on mitigate catastrophic forgetting with Surprise-gated Mixture of Exper…☆36Updated 2 months ago
- A playground to make it easy to try crazy things☆33Updated 3 months ago
- A Full Transcript of the Lighthill Debate on AI from 1973, with Introductory Remarks☆32Updated last year
- Full Automation of Goal-driven LLM Dialog Threads with And-Or Recursors and Refiner Oracles☆45Updated 3 weeks ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated last year
- A tiny autograd engine with a Jax-like API☆74Updated 2 months ago
- A GPU Accelerated Binary Vector Store☆47Updated 7 months ago
- Implement recursion using English as the programming language and an LLM as the runtime.☆238Updated 2 years ago
- A MCP server for symbolic manipulation of mathematical expressions☆40Updated 2 months ago
- ☆27Updated last year
- LLM plugin for pulling content from Hacker News☆118Updated 4 months ago
- Prototyping a question and answer bot over PDFs☆39Updated last year
- An experimental transformer stack and symbolic computation engine built entirely from first principles in pure Python.☆37Updated 5 months ago
- A comprehensive suite of tools, built to liberate science by making the creation, evaluation, and dissemination of research more transpar…☆211Updated last month
- A library for building software agents using behavior trees and language models.☆87Updated 7 months ago
- Physical AI Assistant that illuminates your life☆165Updated last month
- Analyzing hacker news in real-time with Bytewax and Proton☆38Updated last year
- ☆71Updated last year
- Tools for LLM agents.☆62Updated 9 months ago
- Build data processing and data analysis pipelines that leverage the power of LLMs 🧠☆203Updated last week
- Visualize text embeddings☆40Updated 2 years ago
- Sequential Logic☆112Updated last month
- Autoregressive transformers in APL☆106Updated 3 weeks ago