Aims for memory-efficient training (24GB VRAM) on consumer GPUs. Optimizing language models through guidance tokens in reasoning chains, based on DeepSeekRL-Extended.
☆28Feb 23, 2025Updated last year
Alternatives and similar repositories for Guide-GRPO
Users that are interested in Guide-GRPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- Code for "Holistic Physics Solver: Learning PDEs in a Unified Spectral-Physical Space"☆24Mar 25, 2026Updated 2 months ago
- [ICLR 2025] VideoGrain: This repo is the official implementation of "VideoGrain: Modulating Space-Time Attention for Multi-Grained Video …☆158Mar 24, 2025Updated last year
- [ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.☆102Oct 20, 2025Updated 7 months ago
- Self-supervised Point Cloud Representation Learning via Separating Mixed Shapes☆21May 23, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆124Apr 12, 2024Updated 2 years ago
- ☆27Apr 5, 2024Updated 2 years ago
- MediaPipeを用いたハンドジェスチャーによる簡単なマウス操作を行うプログラムです。☆12Mar 17, 2021Updated 5 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Jan 4, 2024Updated 2 years ago
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Apr 28, 2023Updated 3 years ago
- (AAAI2024) Controllable 3D Face Generation with Conditional Style Code Diffusion☆40Apr 17, 2024Updated 2 years ago
- ☆33Feb 17, 2026Updated 3 months ago
- Multi-Figurative Language Generation (COLING 2022)☆12Jan 30, 2023Updated 3 years ago
- Code for our ACL'23 paper on how to identify metaphor mappings with the help of GPT-3☆11May 21, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 8 months ago
- Graph Masked Autoencoders☆27Aug 28, 2022Updated 3 years ago
- ☆41Sep 9, 2025Updated 8 months ago
- C recursive descent parser based on Ian Piumarta's peg(1)☆20Feb 4, 2014Updated 12 years ago
- Multi-Domain Multi-Scale Diffusion Model for Low-Light Image Enhancement (AAAI'24)☆45Mar 1, 2025Updated last year
- Annotations and code for the EMNLP 2018 paper 'Weeding out Conventionalized Metaphors: A Corpus of Novel Metaphor Annotations'☆10Feb 20, 2023Updated 3 years ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆170Oct 20, 2025Updated 7 months ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Jun 3, 2024Updated last year
- Code for A Simple Episodic Linear Probe Improves Visual Recognition in the Wild☆34Feb 1, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Aug 1, 2025Updated 9 months ago
- Turn remote MCP servers into local command workflows.☆59Feb 28, 2026Updated 2 months ago
- Multi-Task instruction-tuned LLaMA☆14May 5, 2023Updated 3 years ago
- [IEEE TIP] Offical implementation for the work "BadCM: Invisible Backdoor Attack against Cross-Modal Learning".☆14Aug 30, 2024Updated last year
- ☆25Jul 17, 2025Updated 10 months ago
- ☆48Mar 25, 2025Updated last year
- Code and data for TACL paper It’s not Rocket Science: Interpreting Figurative Language in Narratives☆15Sep 4, 2023Updated 2 years ago
- Code and datasets for the paper "Can Pre-trained Language Models Interpret Similes as Smart as Human?" (ACL 2022)☆14Jan 4, 2023Updated 3 years ago
- ☆21Mar 16, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Jul 31, 2022Updated 3 years ago
- AC No Code 是偷懒者最好的在OJ中写代码AC的方式: Write nothing; submit nowhere.☆10May 18, 2020Updated 6 years ago
- Open library of musculoskeletal models and examples ready to be used with the AnyBody Modelling System.☆32May 20, 2026Updated last week
- Evaluating Alternatives to SFM Point Cloud Initialization for Gaussian Splatting☆13Jul 8, 2024Updated last year
- Insert Customizable Content into your Autodesk® Fusion 360® Designs and change the parameters to change the customizable parts.☆13Nov 15, 2021Updated 4 years ago
- Svelte context API patched with stricted types☆12Oct 22, 2021Updated 4 years ago
- go实现的弹幕消息推送系统☆13Aug 1, 2022Updated 3 years ago