georgia-tech-db / eva-decordLinks
An efficient video loader for deep learning with smart shuffling that's super easy to digest
☆44Updated last year
Alternatives and similar repositories for eva-decord
Users that are interested in eva-decord are comparing it to the libraries listed below
Sorting:
- FRP Fork☆171Updated 3 months ago
- Faster generation with text-to-image diffusion models.☆222Updated 3 weeks ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆368Updated last month
- Train high-quality text-to-image diffusion models in a data & compute efficient manner☆501Updated 3 months ago
- PyTorch media decoding and encoding☆634Updated this week
- faster parallel inference of mochi-1 video generation model☆124Updated 5 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆424Updated last year
- ☆486Updated 3 months ago
- Making Flux go brrr on GPUs.☆116Updated last week
- ☆434Updated last year
- ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation (AAAI 2025 Oral)☆605Updated 4 months ago
- Data release for the ImageInWords (IIW) paper.☆217Updated 8 months ago
- Easily create large video dataset from video urls☆617Updated 11 months ago
- Beyond Language Models: Byte Models are Digital World Simulators☆325Updated last year
- LLaVA-Interactive-Demo☆375Updated last year
- Projects based on SigLIP (Zhai et. al, 2023) and Hugging Face transformers integration 🤗☆260Updated 5 months ago
- ☆193Updated last year
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆394Updated 4 months ago
- SCEPTER is an open-source framework used for training, fine-tuning, and inference with generative models.☆530Updated 3 months ago
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆177Updated last year
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆137Updated 6 months ago
- [ICML 2025] Official PyTorch implementation of LongVU☆391Updated 2 months ago
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 8 months ago
- ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)☆216Updated last year
- [ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image…☆322Updated 2 weeks ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆549Updated last year
- Implementation of Lumiere, SOTA text-to-video generation from Google Deepmind, in Pytorch☆278Updated last year
- ☆104Updated 2 weeks ago
- A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training☆426Updated this week
- ☆54Updated 2 years ago