minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora
☆40Mar 25, 2024Updated 2 years ago
Alternatives and similar repositories for MiniSora-DiT
Users that are interested in MiniSora-DiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MiniSora: A community aims to explore the implementation path and future development direction of Sora.☆1,285Feb 18, 2025Updated last year
- ☆44Oct 29, 2025Updated 5 months ago
- Crawler and cleaner of data for novelai embedding's training☆21May 22, 2025Updated 10 months ago
- ☆16Dec 15, 2021Updated 4 years ago
- Generic classification model☆10Apr 2, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MICCAI 2022 MELA Challenge: Mediastinal Lesion Analysis (3D Detection)☆11Jun 30, 2022Updated 3 years ago
- Karras et al. (2022) diffusion models for PyTorch☆17Oct 5, 2023Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆40Jun 22, 2024Updated last year
- PyTorch Sphinx Theme☆35Jan 3, 2024Updated 2 years ago
- ☆45Jan 12, 2024Updated 2 years ago
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion☆45Aug 1, 2024Updated last year
- InternEvo is a high-performance training system for giant models.☆38Jan 17, 2024Updated 2 years ago
- ☆36Sep 21, 2025Updated 6 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Apr 14, 2024Updated 2 years ago
- Curated list of recent visual autoregressive (VAR) modeling works☆30Mar 17, 2025Updated last year
- A modified version of origin Magic Animate (https://showlab.github.io/magicanimate/)☆20Feb 27, 2024Updated 2 years ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- 👆Pytorch implementation of "Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion"☆33Jul 28, 2025Updated 8 months ago
- ☆11Oct 17, 2023Updated 2 years ago
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,931Oct 30, 2025Updated 5 months ago
- CSS-in-JS performance tests☆10Jan 4, 2017Updated 9 years ago
- ☆29May 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆105Jun 23, 2024Updated last year
- A Modern Configuration/Registry System designed for deeplearning, with some utils.☆18Dec 23, 2025Updated 3 months ago
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆684Oct 25, 2024Updated last year
- ☆49Jul 12, 2023Updated 2 years ago
- [ICCV2023] Spatio-temporal Prompting Network for Robust Video Feature Extraction☆10Aug 17, 2023Updated 2 years ago
- 📝The official repository of "Rethinking Cross-Generator Image Forgery Detection through DINOv3"☆22Dec 2, 2025Updated 4 months ago
- A Python library for controlling AlphaDog robotic dogs.☆12Jun 19, 2025Updated 9 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆56Feb 2, 2026Updated 2 months ago
- [IJCV 2024] LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models☆951Nov 13, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆26Jun 10, 2025Updated 10 months ago
- VQVAE for video prediction☆31Apr 22, 2022Updated 3 years ago
- ☆53Sep 29, 2025Updated 6 months ago
- Useful utilities for huggingface☆25Dec 26, 2025Updated 3 months ago
- [CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection☆194Mar 29, 2025Updated last year
- MegEngine build with cu11x☆17Mar 13, 2023Updated 3 years ago
- JPEG-LM: LLMs as Image Generators with Canonical Codec Representations☆15Sep 29, 2024Updated last year