Experiment on reimplementation of GRPO RL
☆17Feb 7, 2025Updated last year
Alternatives and similar repositories for grpo_experiment
Users that are interested in grpo_experiment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A star for organising blocks and playing with transformers.☆23Apr 28, 2024Updated last year
- AI-first Customer 360 Framework with Chatbot☆19Aug 26, 2025Updated 6 months ago
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆20Nov 19, 2025Updated 4 months ago
- Opinionated tool to typeset theorems, lemmas and such☆32Feb 4, 2026Updated last month
- 🍔 Chen’s Private Cuisine Menu☆10Jan 4, 2026Updated 2 months ago
- Glob Include Directive for Jade☆10Dec 20, 2015Updated 10 years ago
- A collection of notebooks aiding the understanding of machine-learning papers.☆10Apr 5, 2021Updated 4 years ago
- Lie Algebras using Sympy and backend powered by Rust's pyO3 and ndarray☆12Dec 12, 2023Updated 2 years ago
- ☆11Dec 9, 2025Updated 3 months ago
- 🍔 A clean and minimal food menu template.☆17Apr 4, 2024Updated last year
- ☆21May 18, 2023Updated 2 years ago
- ☆14Jun 24, 2022Updated 3 years ago
- Simplified meal ordering app for local restaurants built with AngularJS and a LoopBack backend.☆12Nov 15, 2013Updated 12 years ago
- Typst Math within Anki flashcards.☆16Feb 4, 2025Updated last year
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- Using OCR to convert images of formulas into Typst code.☆17Jul 27, 2025Updated 7 months ago
- Making use of SVG in iOS and macOS apps☆28Jul 23, 2020Updated 5 years ago
- ☆23Feb 3, 2026Updated last month
- simulation of RGB and depth cameras☆10Apr 1, 2022Updated 3 years ago
- PyTorch-based radio-interferometric imaging reconstruction package with scalable Bayesian uncertainty quantification relying on data-driv…☆12Feb 17, 2025Updated last year
- This repository contains the entire pipline (including data preprocessing, training, testing, evaluation and visualization) for the Shear…☆10Dec 3, 2019Updated 6 years ago
- ☆43Apr 29, 2025Updated 10 months ago
- An example of graph embeddings for wikipedia page recommendations☆11Aug 26, 2021Updated 4 years ago
- ☆12Jan 27, 2025Updated last year
- Applescripts for controlling Spotify☆23Oct 20, 2016Updated 9 years ago
- The companion code for the paper "Variational inference via Wasserstein gradient flows (W-VI) M. Lambert, S. Chewi, F. Bach, S. Bonnabel…☆14Feb 1, 2023Updated 3 years ago
- using Data and Typeable to get a direct reflection system for free, when we're implementing a toy language in Haskell☆15Feb 21, 2020Updated 6 years ago
- ☆12Jun 27, 2023Updated 2 years ago
- A micro ORM that gives developers control over the SQL executed while also providing an easy way to do basic CRUD operations on entities.☆11Jul 22, 2018Updated 7 years ago
- MIG Welder Controller☆10May 20, 2015Updated 10 years ago
- A simple, light-weight RSS parser for browser. Parse strings, URLs and get a JS object back☆10Jul 5, 2017Updated 8 years ago
- ☆13May 12, 2025Updated 10 months ago
- Nitrozen Design (Alpha) for Vue by Fynd☆18Feb 26, 2026Updated 3 weeks ago
- Straightforward and functional theorem/proof environments in Typst.☆17Feb 24, 2025Updated last year
- CuteRest is a REST client tool dedicated for JSON☆11Dec 12, 2023Updated 2 years ago
- A JavaScript implementation of SOM, a minimal Smalltalk for teaching and research.☆17Feb 7, 2024Updated 2 years ago
- Final project for COS 521: Using Hokusai algorithm to approximate frequency counts of hashtags in twitter data stream.☆12Jan 13, 2015Updated 11 years ago
- mysql library binding for D programming language☆18Jul 27, 2019Updated 6 years ago