[Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control
☆45Nov 25, 2025Updated 5 months ago
Alternatives and similar repositories for In-Video-Instructions
Users that are interested in In-Video-Instructions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DMax: Aggressive Parallel Decoding for dLLMs☆110Apr 20, 2026Updated last week
- Vico: Compositional Video Generation as Flow Equalization☆59Nov 15, 2024Updated last year
- [Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.☆34Mar 11, 2025Updated last year
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- Vision Bridge Transformer at Scale☆142Dec 1, 2025Updated 5 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [NeurIPS 2025] Thinkless: LLM Learns When to Think☆258Sep 26, 2025Updated 7 months ago
- ☆17Dec 11, 2024Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- ☆34Dec 29, 2025Updated 4 months ago
- ☆13Nov 25, 2021Updated 4 years ago
- ☆109Nov 27, 2024Updated last year
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆119Jul 15, 2024Updated last year
- [NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient