Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, based on DeepSeekRL-Extended.
☆29Feb 23, 2025Updated last year
Alternatives and similar repositories for Guide-GRPO
Users that are interested in Guide-GRPO are comparing it to the libraries listed below
Sorting:
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation☆59Jun 21, 2023Updated 2 years ago
- Future version of the AnyBody Managed Model Repository with a full thoracic spine model.☆18Updated this week
- ☆25Apr 5, 2024Updated last year
- Code for SEEG: Semantic Energized Co-speech Gesture Generation☆33Dec 3, 2022Updated 3 years ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆16Nov 11, 2025Updated 3 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated 2 months ago
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- CoachLint is your AI coding coach. It guides you through errors instead of just solving them for you.☆23Nov 20, 2025Updated 3 months ago
- VibEx (vx) is a developer-friendly CLI tool that streamlines the process of working with AI coding assistants. It helps developers prepar…☆28May 17, 2025Updated 9 months ago
- Open library of musculoskeletal models and examples ready to be used with the AnyBody Modelling System.☆30Updated this week
- ☆10Aug 22, 2017Updated 8 years ago
- Similar to the 2D Base Model, 3D Base Model is a bridge between images and 3D data.☆25Updated this week