Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe
☆263May 3, 2026Updated this week
Alternatives and similar repositories for OPD
Users that are interested in OPD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆157Apr 9, 2026Updated last month
- ☆35Jun 9, 2024Updated last year
- A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimoda…☆119Apr 30, 2026Updated last week
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- [TMLR 2025] Efficient Diffusion Models: A Survey☆182Dec 8, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [RSS 2025] PartInstruct: Part-level Instruction Following for Fine-grained Robot Manipulation☆17Mar 4, 2026Updated 2 months ago
- A Sober Look at Language Model Reasoning☆94Nov 18, 2025Updated 5 months ago
- survery of small language models☆18Jul 23, 2024Updated last year
- A PyTorch-Lightning based deep learning framework.☆11Apr 15, 2026Updated 3 weeks ago
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆37Nov 24, 2025Updated 5 months ago
- [Paper][EMNLP 2025] Enrich-on-Graph: Query-Graph Alignment for Complex Reasoning with LLM Enriching☆34Feb 8, 2026Updated 3 months ago
- Code for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".☆12Dec 27, 2023Updated 2 years ago
- ☆33May 27, 2025Updated 11 months ago
- ShanghaiTech SI140A Probability & Statistics for EECS, Spring 2023, Spring 2024.☆24May 1, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [EMNLP 2025 Findings] Familiarity-aware Evidence Compression for Retrieval Augmented Generation☆15Aug 20, 2025Updated 8 months ago
- Dataset Quantization with Active Learning based Adaptive Sampling [ECCV 2024]☆10Jul 9, 2024Updated last year
- Non-Autoregressive Math Word Problem Solver with Unified Tree Structure☆12Jan 13, 2024Updated 2 years ago
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆35Apr 13, 2026Updated 3 weeks ago
- 🕵 Code for our EMNLP 2025 Main paper: "FlashAdventure: A Benchmark for GUI Agents Solving Full Story Arcs in Diverse Adventure Games"☆25Apr 26, 2026Updated last week
- Claw-Eval is an evaluation harness for evaluating LLM as agents. All tasks verified by humans.☆522Updated this week
- ☆27Apr 14, 2025Updated last year
- Officail Implementation for "Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance"☆19Jan 25, 2024Updated 2 years ago
- Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention☆65Apr 7, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆27Sep 11, 2024Updated last year
- The OlymMATH dataset☆24Jun 1, 2025Updated 11 months ago
- Source code of "Training Free Graph Neural Networks for Graph Matching"☆12Jul 9, 2022Updated 3 years ago
- Large Language Models(LLMs) of Code☆20Apr 23, 2023Updated 3 years ago
- ☆22Jul 22, 2025Updated 9 months ago
- KeepGPU is a simple CLI app that keeps your GPUs running.☆34Mar 9, 2026Updated 2 months ago
- ☆62Apr 16, 2026Updated 3 weeks ago
- ☆10Jan 19, 2022Updated 4 years ago
- Official repository for the paper "Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation"☆140Mar 18, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Apr 10, 2025Updated last year
- Learning Harmonic Molecular Representations on Riemannian Manifold, ICLR, 2023☆25Mar 23, 2023Updated 3 years ago
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- ☆35Dec 29, 2025Updated 4 months ago
- code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts☆24Nov 29, 2023Updated 2 years ago
- This repository provides the dataset introduced by our WSSTG paper☆13Jul 21, 2019Updated 6 years ago
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 4 months ago