Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, based on DeepSeekRL-Extended.
☆28Feb 23, 2025Updated last year
Alternatives and similar repositories for Guide-GRPO
Users that are interested in Guide-GRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- (VillagerAgent ACL 2024) A Graph based Minecraft multi agents framework☆92Jun 5, 2026Updated last week
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- ☆71Aug 6, 2025Updated 10 months ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆90Feb 6, 2026Updated 4 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [AAAI 2026] The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants☆45Dec 11, 2025Updated 6 months ago
- The official code for [ACM MM 2022] 'In-N-Out Generative Learning for Dense Unsupervised Video Segmentation'.☆20Feb 22, 2023Updated 3 years ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated 2 years ago
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation☆59Jun 21, 2023Updated 2 years ago
- ☆27Apr 5, 2024Updated 2 years ago
- JPEG-LM: LLMs as Image Generators with Canonical Codec Representations☆15Sep 29, 2024Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Jan 4, 2024Updated 2 years ago
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Apr 28, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆37Feb 17, 2026Updated 3 months ago
- Multi-Figurative Language Generation (COLING 2022)☆12Jan 30, 2023Updated 3 years ago
- Official repository for ALT (ALignment with Textual feedback).☆10Jul 25, 2024Updated last year
- Code for our ACL'23 paper on how to identify metaphor mappings with the help of GPT-3☆11May 21, 2025Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 8 months ago
- 本项目设计了一个基于UDP的网络拍卖行程序,包含客户端和服务端。使用语言:python3;UI设计:pyqt5;采用多线程。☆11Mar 27, 2020Updated 6 years ago
- Multi-Domain Multi-Scale Diffusion Model for Low-Light Image Enhancement (AAAI'24)☆45Mar 1, 2025Updated last year
- Code repo for "S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal" (NTIRE workshop @ CVPR 2024)☆11Jun 15, 2024Updated 2 years ago
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆172Oct 20, 2025Updated 7 months ago
- An Android WebView with full screen video☆10Aug 17, 2017Updated 8 years ago
- Code for A Simple Episodic Linear Probe Improves Visual Recognition in the Wild☆34Feb 1, 2023Updated 3 years ago
- [IEEE TIP] Offical implementation for the work "BadCM: Invisible Backdoor Attack against Cross-Modal Learning".☆14Aug 30, 2024Updated last year
- ☆48Mar 25, 2025Updated last year
- Code and data for TACL paper It’s not Rocket Science: Interpreting Figurative Language in Narratives☆15Sep 4, 2023Updated 2 years ago
- ☆16Mar 13, 2023Updated 3 years ago
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- ☆16Mar 22, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- DMAOT ranked 1st in the VOTS 2023 challenge.☆17Dec 21, 2023Updated 2 years ago
- Open library of musculoskeletal models and examples ready to be used with the AnyBody Modelling System.☆33Updated this week
- ☆47Mar 24, 2024Updated 2 years ago
- go实现的弹幕消息推送系统☆13Aug 1, 2022Updated 3 years ago
- [NeurIPS 2021] Open Rule Induction☆19May 22, 2022Updated 4 years ago
- An example to show using OpenCvSharp deal with find difference game's image.☆13Nov 10, 2017Updated 8 years ago
- Metaphor dataset: literal versus non-literal uses of words☆14Nov 8, 2015Updated 10 years ago