wangqinsi1 / GAINRLView external linksLinks
This is the official Python version of Angles Don’t Lie: Unlocking Training-Efficient RL Through the Model’s Own Signals.
☆81Sep 26, 2025Updated 4 months ago
Alternatives and similar repositories for GAINRL
Users that are interested in GAINRL are comparing it to the libraries listed below
Sorting:
- A simple visual test-time scaling method for GUI agent grounding☆20Dec 7, 2025Updated 2 months ago
- [ICLR2026] "Co-rewarding: Stable Self-supervised RL for Eliciting Reasoning in Large Language Models"☆30Feb 4, 2026Updated last week
- Preview Code for Continuum Paper☆35Jan 26, 2026Updated 2 weeks ago
- [ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely☆24Jun 26, 2024Updated last year
- ☆34Dec 9, 2025Updated 2 months ago
- Code for the paper Boosting Accuracy and Robustness of Student Models via Adaptive Adversarial Distillation (CVPR 2023).☆34May 26, 2023Updated 2 years ago
- MemRec☆36Jan 16, 2026Updated 3 weeks ago
- ☆11Sep 27, 2022Updated 3 years ago
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 5 months ago
- Repository of IPBench☆19Jan 4, 2026Updated last month
- ☆35Mar 12, 2025Updated 11 months ago
- ☆11Apr 28, 2024Updated last year
- ☆11Jul 17, 2023Updated 2 years ago
- ☆11Mar 31, 2022Updated 3 years ago
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 5 months ago
- [NeurIPS 2025] Official code for "Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms"☆23Oct 23, 2025Updated 3 months ago
- EduAction is an educational content generation application powered by GenAI developed during the Encode Club AI Hackathon London 2024.☆12Mar 24, 2024Updated last year
- GBM implementation on Legate☆14Jan 28, 2026Updated 2 weeks ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning☆24Jan 5, 2026Updated last month
- The first EEG foundation model explicitly tailored for the motor imagery (MI) paradigm.☆91Feb 2, 2026Updated last week
- ☆15Jul 26, 2022Updated 3 years ago
- HippoMM: Hippocampal-inspired Multimodal Memory☆15May 22, 2025Updated 8 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated 3 weeks ago
- Long Context Research☆26Jan 26, 2026Updated 2 weeks ago
- Resolves AWS secretmanager secrets from variables that give the secret ARNs and exposes them as plain environment variables☆13Aug 13, 2024Updated last year
- ☆15May 26, 2025Updated 8 months ago
- Serial monitor in rust☆14Jul 24, 2024Updated last year
- ☆12May 23, 2024Updated last year
- [ICML 2023] Protecting Language Generation Models via Invisible Watermarking☆13Sep 8, 2023Updated 2 years ago
- The source code and the data for ACL 2022 paper "Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Dat…☆14Apr 21, 2023Updated 2 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆34Oct 16, 2025Updated 3 months ago
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 6 months ago
- ☆15Jan 12, 2026Updated last month
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 4 months ago
- 包括轮廓自动识别筛选,区域分割,以及传统的缺陷检测算法☆13Jul 16, 2022Updated 3 years ago
- ☆13Mar 2, 2025Updated 11 months ago
- An LLM inference engine, written in C++☆18Feb 5, 2026Updated last week
- This is the implementation of the 4th place solution (yu4u's part) for RSNA 2024 Lumbar Spine Degenerative Classification at Kaggle.☆10Oct 11, 2024Updated last year