cnsdqd-dyb / Guide-GRPOView external linksLinks
Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, based on DeepSeekRL-Extended.
☆29Feb 23, 2025Updated 11 months ago
Alternatives and similar repositories for Guide-GRPO
Users that are interested in Guide-GRPO are comparing it to the libraries listed below
Sorting:
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Aug 19, 2023Updated 2 years ago
- Evaluation code for "Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation"☆18Mar 10, 2024Updated last year
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆89Feb 6, 2026Updated last week
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation☆59Jun 21, 2023Updated 2 years ago
- Musculoskeletal Analysis extension for 3D Slicer. Currently has cortical, cancellous, and bone density analysis.☆12May 2, 2024Updated last year
- A simple lightweight Model Context Protocol (MCP) server integration framework☆17Jan 23, 2026Updated 3 weeks ago
- ☆25Apr 5, 2024Updated last year
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆13Nov 11, 2025Updated 3 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated last month
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- Code for A Simple Episodic Linear Probe Improves Visual Recognition in the Wild☆33Feb 1, 2023Updated 3 years ago
- VibEx (vx) is a developer-friendly CLI tool that streamlines the process of working with AI coding assistants. It helps developers prepar…☆28May 17, 2025Updated 8 months ago
- CoachLint is your AI coding coach. It guides you through errors instead of just solving them for you.☆23Nov 20, 2025Updated 2 months ago
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆10Mar 24, 2025Updated 10 months ago
- An open source deep research clone. AI Agent (Local LLM or Gemini) that reasons large amounts of web data extracted with SwiftSoup.☆13Feb 10, 2025Updated last year
- A powerful AI prompt engineering tool that transforms simple instructions into detailed, context-rich prompts using Google's Gemini Pro t…☆15Aug 28, 2025Updated 5 months ago
- IBM watsonx Code Assistant for Red Hat Ansible Lightspeed demystifies the process of Ansible Playbook creation through generative AI-powe…☆19Sep 18, 2025Updated 4 months ago
- SYSTEM PROMPT TRANSPARENCY FOR ALL☆11May 22, 2025Updated 8 months ago
- Access to AI for free for anyone inside your Visual Studio. This is a Visual Studio extension.☆19Dec 29, 2025Updated last month
- Houdini procedural tool for creating rivers☆11Apr 6, 2023Updated 2 years ago
- ☆14Apr 4, 2025Updated 10 months ago
- ☆13Dec 12, 2022Updated 3 years ago
- "Open-source toolkit (Python Library, Registry API, CLI) for secure, decentralized AI agent interoperability using A2A/MCP."☆14May 10, 2025Updated 9 months ago
- Shakey OS Mobile AI Framework for React Native allowing people to build React Native apps for IOS and Android with AI tooling and wallet …☆28Feb 3, 2025Updated last year
- 📱 A template for your next React Native project: Expo, TypeScript, ReStyle, Husky, react-navigation, react-query, react-hook-form, zusta…☆16Dec 15, 2025Updated last month
- React Native, Right Now (rn-rn)☆18Sep 2, 2025Updated 5 months ago
- Emphasizes AI-based projects for various companies.☆15Apr 1, 2025Updated 10 months ago
- an open database of human head models and companion optode locations for realistic Monte Carlo photon simulations☆14Nov 19, 2025Updated 2 months ago
- AutonomousSphere is an agentic collaboration server. Agents talk, act, and use tools like teammates. Federated servers form an internet o…☆16May 13, 2025Updated 9 months ago
- A Discord bot to retrieve Shopify Orders and Statistics☆10Dec 9, 2025Updated 2 months ago
- 💀 gigasmol: a lightweight wrapper for gigachat api model for seamless use with smolagents.☆15Oct 23, 2025Updated 3 months ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- AI Tasks. A LLM integrated agent orchestration tool for Liferay.☆14May 16, 2025Updated 8 months ago
- ☆14Jun 19, 2024Updated last year
- [CVPR2024] CapHuman: Capture Your Moments in Parallel Universes☆100Nov 20, 2024Updated last year
- ☆47Mar 25, 2025Updated 10 months ago
- ☆25Jul 28, 2025Updated 6 months ago
- ☆13Jun 29, 2024Updated last year