Helios: Real Real-Time Long Video Generation Model
☆487Mar 5, 2026Updated this week
Alternatives and similar repositories for Helios
Users that are interested in Helios are comparing it to the libraries listed below
Sorting:
- iFSQ & LlamaGen-REPA☆93Jan 27, 2026Updated last month
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆38Jan 9, 2026Updated last month
- Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Gene…☆407Updated this week
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆38Dec 5, 2024Updated last year
- [CVPR 2026] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆284Dec 15, 2025Updated 2 months ago
- StreamDiffusion, Live Stream APP☆364Updated this week
- ☆213Feb 11, 2025Updated last year
- Infinite-Forcing: Towards Infinite-Long Video Generation☆137Nov 13, 2025Updated 3 months ago
- Official repository for “PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss”☆205Feb 3, 2026Updated last month
- Official Implementation of VideoDPO☆160Jun 1, 2025Updated 9 months ago
- A tool for running and customizing real-time, interactive generative AI pipelines and models☆248Updated this week
- Official Repo for Self-Forcing++ High Quality Long Video Generation☆241Oct 13, 2025Updated 4 months ago
- GPT as a Monte Carlo Language Tree: A Probabilistic Perspective☆45Jan 18, 2025Updated last year
- Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".☆36Jul 10, 2025Updated 7 months ago
- [CVPR 2026🔥] Enhancing Spatial Understanding in Image Generation via Reward Modeling☆51Updated this week
- Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory☆122Feb 9, 2026Updated 3 weeks ago
- ICML2025☆63Aug 28, 2025Updated 6 months ago
- [CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model☆198May 11, 2025Updated 9 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆245Aug 15, 2025Updated 6 months ago
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆144Feb 26, 2026Updated last week
- [NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation☆199Jan 7, 2026Updated last month
- Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Vi…☆237Mar 19, 2025Updated 11 months ago
- 4-steps distilled version of Wan2.2-TI2V-5B☆144Jan 26, 2026Updated last month
- [NeurIPS 2023] Official PyTorch implementation for the paper "CRoSS: Diffusion Model Makes Controllable, Robust and Secure Image Steganog…☆11Sep 28, 2023Updated 2 years ago
- Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis☆244Updated this week
- ☆88Feb 14, 2026Updated 2 weeks ago
- ☆44Jan 19, 2026Updated last month
- On Path to Multimodal Generalist: General-Level and General-Bench☆18Jul 11, 2025Updated 7 months ago
- [ICLR 2025] Distilled Decoding 1: One-step Sampling of Image Auto-regressive Models with Flow Matching☆20Apr 21, 2025Updated 10 months ago
- A unified inference and post-training framework for accelerated video generation.☆3,111Updated this week
- [CVPR 2026] OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer☆224Feb 21, 2026Updated last week
- 📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.☆62Dec 9, 2025Updated 2 months ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"☆138Oct 8, 2024Updated last year
- 🚀 [ICLR 2026] SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation☆62Jan 13, 2026Updated last month
- HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency☆1,175Jan 13, 2026Updated last month
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆322Mar 30, 2025Updated 11 months ago
- WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation☆186Nov 6, 2025Updated 4 months ago
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆44Feb 10, 2026Updated 3 weeks ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Sep 6, 2024Updated last year