Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
☆76Apr 16, 2026Updated this week
Alternatives and similar repositories for OPD
Users that are interested in OPD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Summary of courses taken during undergraduate studies at ShanghaiTech University, master's studies at Tsinghua University☆49Feb 14, 2026Updated 2 months ago
- ☆34Jun 9, 2024Updated last year
- [TMLR 2025] Efficient Diffusion Models: A Survey☆180Dec 8, 2025Updated 4 months ago
- [RSS 2025] PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation☆17Mar 4, 2026Updated last month
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆68Dec 15, 2025Updated 4 months ago
- A PyTorch-Lightning based deep learning framework.☆11Dec 2, 2025Updated 4 months ago
- Official implementation of [CVPR 2025] RePerformer: Immersive Human-centric Volumetric Videos from Playback to Photoreal Reperformance☆25Sep 9, 2025Updated 7 months ago
- ShanghaiTech CS110 Computer Architecture, Spring 2023.☆10Updated this week
- ☆18Nov 21, 2024Updated last year
- ShanghaiTech SI140A Probability & Statistics for EECS, Spring 2023, Spring 2024.☆24Feb 15, 2026Updated 2 months ago
- Dataset Quantization with Active Learning based Adaptive Sampling [ECCV 2024]☆10Jul 9, 2024Updated last year
- ☆123Updated this week
- A Sober Look at Language Model Reasoning☆94Nov 18, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Dec 14, 2025Updated 4 months ago
- ☆18May 31, 2025Updated 10 months ago
- ShanghaiTech CS101 Algorithm and Data Structures, Fall 2022, Fall 2024.☆13Feb 15, 2026Updated 2 months ago
- Pytorch Implementation of Residual Multiplicative Filter Networks, NeurIPS 2022☆22Nov 17, 2022Updated 3 years ago
- ☆22Apr 22, 2025Updated 11 months ago
- ☆21Feb 24, 2025Updated last year
- Large Language Models(LLMs) of Code☆20Apr 23, 2023Updated 2 years ago
- ☆21Jul 22, 2025Updated 8 months ago
- E_G_M_C_T_S☆15Nov 30, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Systematic Analysis and Discussion of Claude Code for Designing Today's and Future AI Agent Systems☆182Updated this week
- Learning Harmonic Molecular Representations on Riemannian Manifold, ICLR, 2023☆25Mar 23, 2023Updated 3 years ago
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD☆23Dec 13, 2022Updated 3 years ago
- Prioritize Alignment in Dataset Distillation☆21Dec 3, 2024Updated last year
- ☆13Apr 22, 2024Updated last year
- ☆26Jun 25, 2025Updated 9 months ago
- iTechX: Open source, edX-style, free course sharing platform.☆92Dec 7, 2025Updated 4 months ago
- OPSTL: Self-supervised Skeleton-based Action Recognition in Occluded Environments☆14Oct 25, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Research on the Construction and Application of Paraphrase Parallel Corpus☆11Oct 26, 2020Updated 5 years ago
- (CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…☆27Aug 23, 2025Updated 7 months ago
- ☆25May 31, 2024Updated last year
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆16May 30, 2025Updated 10 months ago
- [SIGGRAPH Asia 2024] Streaming Volumetric Video SIBR Viewer for V3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussi…☆31Nov 27, 2024Updated last year
- 解压缩<时光印记>软件中的数据☆17Sep 24, 2021Updated 4 years ago
- The dataset of the paper "HiFi4G: High-Fidelity Human Performance Rendering via Compact Gaussian Splatting".☆41Jan 12, 2026Updated 3 months ago