Experiment on reimplementation of GRPO RL
☆17Feb 7, 2025Updated last year
Alternatives and similar repositories for grpo_experiment
Users that are interested in grpo_experiment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Helper methods for Pandas Series and DataFrames to calculate numerically derivative and integral☆11Jun 7, 2019Updated 6 years ago
- A star for organising blocks and playing with transformers.☆23Apr 28, 2024Updated 2 years ago
- AI-first Customer 360 Framework with Chatbot☆19Aug 26, 2025Updated 8 months ago
- A Low-Overhead tool for Floating-Point Exception Detection in NVIDIA GPUs☆13Dec 17, 2024Updated last year
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workload…☆24Oct 19, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- LiteLLM model integration for Pydantic AI framework - access 100+ LLM providers through a unified interface☆22Apr 15, 2026Updated 2 weeks ago
- micrograd in rust☆16Oct 6, 2024Updated last year
- Speech-to-text typing for Linux/Wayland using Whisper.☆38Apr 21, 2026Updated 2 weeks ago
- Glob Include Directive for Jade☆10Dec 20, 2015Updated 10 years ago
- A digital Lunetta synth based on the 40106 inverter.☆11Nov 9, 2021Updated 4 years ago
- 3D geoms for plotnine (grammar of graphics in Python)☆13Aug 5, 2022Updated 3 years ago
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- Toy distributed PostgreSQL by implementing SQL over KV☆11Jan 14, 2026Updated 3 months ago
- AirGradient Open Source Map☆26Apr 1, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Playground to practice "Designing Data-Intensive Applications" concepts☆13Jan 31, 2023Updated 3 years ago
- Code-base for the paper Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective.☆11Jun 26, 2021Updated 4 years ago
- USB 2.0 PC↔FPGA link using FT2232H Sync-FIFO — full Verilog core☆38Apr 12, 2026Updated 3 weeks ago
- Simplified meal ordering app for local restaurants built with AngularJS and a LoopBack backend.☆12Nov 15, 2013Updated 12 years ago
- ☆10Jun 13, 2022Updated 3 years ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- An implementation of Deepmind's MuZero algorithm.☆16Aug 23, 2021Updated 4 years ago
- PySOM - The Simple Object Machine Smalltalk implemented in Python☆19Aug 19, 2025Updated 8 months ago
- brew autosuggestions for unknown commands on macos☆27Sep 7, 2025Updated 7 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Higher Order SVD implementation in PyTorch☆13Nov 14, 2022Updated 3 years ago
- This repository contains the entire pipline (including data preprocessing, training, testing, evaluation and visualization) for the Shear…☆10Dec 3, 2019Updated 6 years ago
- Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.☆12Jan 14, 2017Updated 9 years ago
- An example of graph embeddings for wikipedia page recommendations☆11Aug 26, 2021Updated 4 years ago
- ☆12Jan 27, 2025Updated last year
- The SQLite of Semantic Search☆31Sep 25, 2025Updated 7 months ago
- using Data and Typeable to get a direct reflection system for free, when we're implementing a toy language in Haskell☆15Feb 21, 2020Updated 6 years ago
- ☆14Apr 20, 2026Updated 2 weeks ago
- Python scripts to process EVS (Event-based vision sensor) data☆10Jan 30, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Logistic Regression from Scratch - NumPy implementation with L1 and L2 ,cross-validation, Grid-Search, and sklearn benchmarks. Complete …☆26Oct 22, 2025Updated 6 months ago
- Code release for our SIGGRAPH 2022 paper "Diffeomorphic Neural Surface Parameterization for 3D and Reflectance Acquisition"☆18Dec 5, 2022Updated 3 years ago
- yet another model checker☆23Apr 21, 2026Updated 2 weeks ago
- MIG Welder Controller☆10May 20, 2015Updated 10 years ago
- 📱 RUNIC tamper detection demo - designed to serve as a parallel for understanding more complex tamper detection and integrity systems su…☆16Apr 13, 2024Updated 2 years ago
- A micro ORM that gives developers control over the SQL executed while also providing an easy way to do basic CRUD operations on entities.☆11Jul 22, 2018Updated 7 years ago
- A JavaScript implementation of SOM, a minimal Smalltalk for teaching and research.☆17Feb 7, 2024Updated 2 years ago