[CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way
☆47Oct 10, 2025Updated 4 months ago
Alternatives and similar repositories for ByTheWay
Users that are interested in ByTheWay are comparing it to the libraries listed below
Sorting:
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆37Jan 21, 2025Updated last year
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆86Sep 18, 2025Updated 5 months ago
- (ICLR 2026)Official repository of 'ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing’☆59Jan 26, 2026Updated last month
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆37Nov 26, 2025Updated 3 months ago
- From a student from F2103801 in Shanghai Jiao Tong University☆16Oct 3, 2024Updated last year
- [CVPR 2026] Official release of "Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning"☆112Feb 25, 2026Updated last week
- We introduce OpenStory++, a large-scale open-domain dataset focusing on enabling MLLMs to perform storytelling generation tasks.☆16Aug 30, 2024Updated last year
- A custom node extension for ComfyUI that integrates Google's Veo 2 text-to-video generation capabilities.☆32Apr 12, 2025Updated 10 months ago
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆505Oct 25, 2025Updated 4 months ago
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆18Dec 18, 2024Updated last year
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆117Feb 13, 2026Updated 2 weeks ago
- ☆16Apr 23, 2024Updated last year
- ☆20Jun 26, 2024Updated last year
- [ICLR 2025] Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation☆515Jun 17, 2025Updated 8 months ago
- [ICLR 2026] An official implementation of "STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence"☆40Jan 17, 2026Updated last month
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆35Aug 28, 2025Updated 6 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Jan 26, 2025Updated last year
- ☆36Oct 12, 2024Updated last year
- A repo for generating random NFTs with metadata 100% on chain!☆37Mar 8, 2024Updated last year
- RelightVid: Temporal-Consistent Diffusion Model for Video Relighting☆102Apr 2, 2025Updated 11 months ago
- ☆19Jul 11, 2024Updated last year
- Utils for kolors☆17May 5, 2025Updated 9 months ago
- Txt2Img | Img2Img | + Multiple LoRAs, All in one jupyter notebook for Flux.1 dev/schnell. Able to run on Google Colab Free Tier☆21Dec 3, 2024Updated last year
- ☆24Feb 21, 2025Updated last year
- ☆25Mar 30, 2025Updated 11 months ago
- [ICLR 2026] Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆55Jan 26, 2026Updated last month
- Advanced drum machine for ComfyUI featuring a 64-step sequencer, custom sample support, and retro hardware aesthetics.☆20Jan 19, 2026Updated last month