wangqinsi1/GAINRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wangqinsi1/GAINRL)

wangqinsi1 / GAINRL

[NeurIPS Spotlight 2025] Angles Don’t Lie: Unlocking Training-Efficient RL Through the Model’s Own Signals.

☆83

Alternatives and similar repositories for GAINRL

Users that are interested in GAINRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

T2S-Bench / T2S-Bench
View on GitHub
This is Official implementation for T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasonin…
☆24Mar 5, 2026Updated 4 months ago
wangqinsi1 / 2025-ICML-CoreMatching
View on GitHub
[ICML 2025] CoreMatching: Co-adaptive Sparse Inference Framework for Comprehensive Acceleration of Vision Language Model
☆16May 27, 2025Updated last year
Yuzhe-Fu / FlashFPS
View on GitHub
[DAC 2026] FlashFPS
☆15Jun 1, 2026Updated last month
wangqinsi1 / CoreInfer
View on GitHub
This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…
☆18Oct 25, 2024Updated last year
wangqinsi1 / Vision-Zero
View on GitHub
[ICLR 2026] Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play.
☆136Feb 6, 2026Updated 5 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Yuzhe-Fu / FractalCloud
View on GitHub
[HPCA 2026] FractalCloud: A Fractal-Inspired Architecture for Efficient Large-Scale Point Cloud Processing
☆22Apr 21, 2026Updated 3 months ago
Ting-Justin-Jiang / ZEUS
View on GitHub
[ACM MM 2026]⚡ZEUS accelerates your diffuser. Any modality. Any model. Any scheduler. https://yixiao-wang-stats.github.io/zeus/
☆20Jun 2, 2026Updated last month
Zishan-Shao / FlashSVD
View on GitHub
[AAAI 2026] Official implementation of "FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models". If you find this reposi…
☆17May 1, 2026Updated 2 months ago
wangqinsi1 / MathNAS
View on GitHub
[NeurIPS 2023]MathNAS: If Blocks Have a Role in Mathematical Architecture Design.
☆37Apr 10, 2024Updated 2 years ago
dubcyfor3 / Focus
View on GitHub
[HPCA 2026 Best Paper Candidate] Official implementation of "Focus: A Streaming Concentration Architecture for Efficient Vision-Language …
☆59Feb 8, 2026Updated 5 months ago
seanscott1991 / Duke_SQL4DQA
View on GitHub
☆16Nov 5, 2025Updated 8 months ago
seamoke / DPH-RL
View on GitHub
This is the official implementation of paper "The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement…
☆20Feb 10, 2026Updated 5 months ago
gsarridis / FLAC
View on GitHub
Fairness-Aware Representation Learning by Suppressing Attribute-Class Associations
☆13Mar 19, 2026Updated 4 months ago
mathpn / llm-docsmith
View on GitHub
Generate Python docstrings automatically with LLM and syntax trees
☆20Jun 13, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tianyi-lab / Moltbook_Socialization
View on GitHub
Does Socialization Emerge in AI Agent Society? A Case Study of Moltbook
☆18Feb 17, 2026Updated 5 months ago
svg-project / Quant-VideoGen
View on GitHub
[ICML2026] Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
☆60Updated this week
ZBox1005 / CoT-UQ
View on GitHub
[ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"
☆17Apr 3, 2025Updated last year
eliyastein / llm-zsh-plugin
View on GitHub
Zsh completion plugin for the LLM CLI tool by Simon Willison
☆21May 28, 2025Updated last year
ELM-Research / ECG-Language-Models
View on GitHub
A research-oriented training and evaluation framework for ECG-Language Models (ELMs)
☆16Updated this week
Leey21 / A-Data-Centric-Study
View on GitHub
☆18Mar 2, 2026Updated 4 months ago
guten-tag-100 / Real-Time-TTS-AI
View on GitHub
☆15Jun 11, 2025Updated last year
Labman42 / JetEngine
View on GitHub
A lightweight Inference Engine built for block diffusion models
☆47Apr 12, 2026Updated 3 months ago
xyzsam / mallacc
View on GitHub
Mallacc: Accelerating Memory Allocation
☆13Jan 2, 2018Updated 8 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
zhexinli / Q-ViT-DeiT
View on GitHub
DeiT implementation for Q-ViT
☆26Apr 21, 2025Updated last year
SAI-Lab-NYU / QSVD
View on GitHub
This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …
☆28May 16, 2026Updated 2 months ago
mkantwala / DeepSeek-R1-TrainingSuite
View on GitHub
Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe…
☆13Jan 29, 2025Updated last year
changyi7231 / NFE
View on GitHub
A PyTorch implementation of Knowledge Graph Embedding by Normalizing Flows.
☆10Nov 22, 2022Updated 3 years ago
mcmahon-lab / ONN-QAT-SQL
View on GitHub
Scripts for training neural networks resistant to photon shot noise with quantization-aware training, together with the code for simulati…
☆21Jan 31, 2022Updated 4 years ago
sherlockchou86 / face_properties_based_vggface
View on GitHub
Age, gender and race estimation based on VGGFace using Tensorflow 2.0
☆15Apr 30, 2020Updated 6 years ago
zjunlp / xKG
View on GitHub
Executable Knowledge Graphs for Replicating AI Research
☆16Jul 9, 2026Updated 2 weeks ago
heswithme / claude-usage-analyzer
View on GitHub
Analyze Claude AI usage logs and calculate costs
☆19Jun 15, 2025Updated last year
ylsung / rsq
View on GitHub
Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"
☆23Mar 25, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
limenlp / verl
View on GitHub
AdaRFT: Efficient Reinforcement Finetuning via Adaptive Curriculum Learning
☆56Jun 13, 2025Updated last year
mandyyyyii / east
View on GitHub
☆19Aug 4, 2025Updated 11 months ago
anitarau / SurgBenchKit
View on GitHub
Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"
☆21Jun 2, 2025Updated last year
enyac-group / UniQL
View on GitHub
UniQL official repository (ICLR 2026)
☆17Jan 27, 2026Updated 6 months ago
kaichen / claco
View on GitHub
claco is a CLI tool for boosting Claude Code productivity - manage hooks, slash commands, and inspect session history
☆34Feb 11, 2026Updated 5 months ago
StarDewXXX / AdaR1
View on GitHub
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆24May 6, 2026Updated 2 months ago
syr-cn / ReMemR1
View on GitHub
Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents
☆43Apr 13, 2026Updated 3 months ago