Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"
☆38Feb 21, 2026Updated 2 weeks ago
Alternatives and similar repositories for G-OPD
Users that are interested in G-OPD are comparing it to the libraries listed below
Sorting:
- A Comprehensive Benchmark of Imbalanced Graph Learning (Accepted by ICLR 2025 Spotlight)☆11Apr 17, 2025Updated 10 months ago
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆28Feb 24, 2026Updated last week
- ☆14Updated this week
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆33Nov 11, 2025Updated 3 months ago
- ☆13Jan 14, 2026Updated last month
- Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Opt…☆25Jan 10, 2026Updated last month
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- Generate machine learning models fully automatically to clasiffiy any images using SERP data☆12Aug 25, 2022Updated 3 years ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆25Feb 10, 2026Updated 3 weeks ago
- 4-player chess engine☆11Feb 20, 2024Updated 2 years ago
- Code for paper "Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs"☆12Jun 11, 2025Updated 8 months ago
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆13Apr 1, 2025Updated 11 months ago
- ☆31Sep 19, 2025Updated 5 months ago
- ☆32Feb 13, 2026Updated 3 weeks ago
- Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control☆37Feb 22, 2026Updated last week
- 在 Mirai Console 中使用MCL管理包和其他高级功能☆10Nov 13, 2022Updated 3 years ago
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- ☆10Oct 20, 2023Updated 2 years ago
- All-in-One Safety Evaluation Framwork☆42Updated this week
- Exploiting Inter-sample and Inter-feature Relations in Dataset Distillation (CVPR24)☆11Jun 16, 2024Updated last year
- [ICML 2025] Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling☆12May 5, 2025Updated 10 months ago
- 针对常见的BAT公司中的大数据面试和笔试问题,列出解决思路,并使用python来实现☆11Aug 17, 2015Updated 10 years ago
- An Efficient Dataset Condensation Plugin and Its Application to Continual Learning. NeurIPS, 2023.☆12Nov 29, 2023Updated 2 years ago
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆17Feb 25, 2025Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- This project demonstrates a real-time delivery location tracking system similar to Zomato/Swiggy, built using Spring Boot and Apache Kafk…☆28Dec 4, 2025Updated 3 months ago
- 一台海外Linux服务器,一行代码, 便能实现翻墙。好用的话求个Star。☆10Dec 20, 2018Updated 7 years ago
- ViGiL3D: A Linguistically Diverse Dataset for 3D Visual Grounding☆17Aug 8, 2025Updated 7 months ago
- Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends [NeurIPS 2023]☆10Jan 28, 2024Updated 2 years ago
- daVinci-Agency: Unlocking Long-Horizon Agency Data-Efficiently☆33Feb 4, 2026Updated last month
- [NeurIPS 2025] Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking☆23Oct 22, 2025Updated 4 months ago
- ☆12Jul 16, 2024Updated last year
- ☆36Feb 12, 2026Updated 3 weeks ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- collab-dev - Collaboration Metrics for Code Reviews☆23May 12, 2025Updated 9 months ago
- Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"☆25Updated this week
- Attention Is All You Need (https://arxiv.org/abs/1706.03762)☆10Apr 26, 2018Updated 7 years ago
- Contextual Vision Transformers for Robust Representation Learning☆15Oct 19, 2023Updated 2 years ago
- [CVPR 2025] An Implementation of the paper "Pre-Instruction Data Selection for Visual Instruction Tuning"☆17Jun 9, 2025Updated 8 months ago