georgia-tech-db / eva-decordLinks
An efficient video loader for deep learning with smart shuffling that's super easy to digest
☆42Updated last year
Alternatives and similar repositories for eva-decord
Users that are interested in eva-decord are comparing it to the libraries listed below
Sorting:
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 7 months ago
- FRP Fork☆169Updated 2 months ago
- ☆54Updated 2 years ago
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- ☆77Updated 9 months ago
- faster parallel inference of mochi-1 video generation model☆121Updated 4 months ago
- Data release for the ImageInWords (IIW) paper.☆215Updated 7 months ago
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆361Updated 3 weeks ago
- Faster generation with text-to-image diffusion models.☆215Updated 8 months ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆135Updated 5 months ago
- Image Prompter for Gradio☆92Updated last year
- Focused on fast experimentation and simplicity☆75Updated 6 months ago
- ONNX Runtime prebuilt wheels for Apple Silicon (M1 / M2 / M3 / ARM64)☆215Updated 11 months ago
- This repository implements the idea of "caption upsampling" from DALL-E 3 with Zephyr-7B and gathers results with SDXL.☆152Updated last year
- Open-source and reproducible benchmarks for Speaker Diarization☆27Updated this week
- Official PyTorch implementation of TokenSet.☆121Updated 3 months ago
- A third-party component library based on Gradio.☆108Updated last week
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆157Updated last year
- ☆101Updated 5 months ago
- SSD-1B, an open-source text-to-image model, outperforming previous versions by being 50% smaller and 60% faster than SDXL.☆177Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆55Updated this week
- [NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim☆332Updated 4 months ago
- Official code for "RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control"☆388Updated 3 months ago
- Code repository for T2V-Turbo and T2V-Turbo-v2☆302Updated 4 months ago
- ☆63Updated 9 months ago
- Easily create large video dataset from video urls☆616Updated 10 months ago
- InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions☆129Updated last year
- Python bindings for ggml☆141Updated 9 months ago
- Scaling Vision Pre-Training to 4K Resolution☆186Updated 3 weeks ago
- ☆58Updated last year