Official Repository of "Learning what reinforcement learning can't"
☆79Dec 30, 2025Updated 2 months ago
Alternatives and similar repositories for ReLIFT
Users that are interested in ReLIFT are comparing it to the libraries listed below
Sorting:
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆418Oct 4, 2025Updated 5 months ago
- ☆17Dec 23, 2025Updated 2 months ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆21Oct 14, 2025Updated 4 months ago
- The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)☆15Jan 2, 2023Updated 3 years ago
- Co-Reinforcement Learning for Unified Multimodal Understanding and Generation☆39Jul 22, 2025Updated 7 months ago
- ☆19Nov 13, 2023Updated 2 years ago
- REverse-Engineered Reasoning for Open-Ended Generation☆93Sep 10, 2025Updated 5 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆42Nov 11, 2025Updated 3 months ago
- [NeurIPS'25] Official Implementation of RISE (Reinforcing Reasoning with Self-Verification)☆31Aug 8, 2025Updated 6 months ago
- Code and data for "Does Spatial Cognition Emerge in Frontier Models?"☆27Apr 18, 2025Updated 10 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆533Updated this week
- ☆27Dec 29, 2023Updated 2 years ago
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆38Apr 24, 2025Updated 10 months ago
- ☆43Mar 15, 2025Updated 11 months ago
- B-Spline Density Estimation Library - nonparametric density estimation using B-Spline density estimator from univariate sample.☆16Aug 22, 2021Updated 4 years ago
- ☆13Aug 5, 2024Updated last year
- Your efficient and accurate answer verification system for RL training.☆41Jun 23, 2025Updated 8 months ago
- Introduction about AWESOME_ENTROPY+LRM_PAPERS☆30Dec 16, 2025Updated 2 months ago
- ☆34Oct 22, 2025Updated 4 months ago
- Starter SDK for full-stack EVM applications, built for TreeHacks 2025 Web3 Workshop☆13Feb 14, 2025Updated last year
- This is an official implementation of the Reward rAnked Fine-Tuning Algorithm (RAFT), also known as iterative best-of-n fine-tuning or re…☆39Sep 22, 2024Updated last year
- ☆72Jun 10, 2025Updated 8 months ago
- ☆88Jun 7, 2024Updated last year
- [ICML 2025] Official Implementation of GLIDER☆72Oct 9, 2025Updated 4 months ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆18Feb 14, 2026Updated 2 weeks ago
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- Training and testing code from our CVPR 2023 paper "Are Deep Neural Networks SMARTer than Second Graders?"☆11Aug 10, 2023Updated 2 years ago
- Source Code for Captionomaly: A Deep Learning Toolbox for Anomaly Captioning in Surveillance Videos☆13Jun 26, 2023Updated 2 years ago
- Evaluation Pipeline for medical tasks.☆12Feb 13, 2026Updated 2 weeks ago
- Some of my practices on Algorithms : ) 这个仓库保存着我在 LeetCode、剑指Offer 上的一些解答,代码中保留了必要的注释。不一定是最优的解答,但力保代码简洁易懂。后续还会整合其他题库,如若发现什么错误,希望你能告诉我或帮助我…☆11Dec 3, 2024Updated last year
- ☆20Aug 22, 2025Updated 6 months ago
- ProxyExplainer for Graph Neural Networks☆15Oct 24, 2024Updated last year
- A Python library for building modular, reproducible simulation pipelines in minutes☆32Aug 22, 2025Updated 6 months ago
- ☆10Oct 11, 2022Updated 3 years ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated last month
- ☆11Nov 8, 2023Updated 2 years ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,219Aug 27, 2025Updated 6 months ago
- PyTorch Implementation of "ZstGAN: An Adversarial Approach for Unsupervised Zero-Shot Image-to-Image Translation"☆42Jun 1, 2019Updated 6 years ago
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 4 years ago