wilson1yan / VideoGPT-PaperLinks
☆18Updated 4 years ago
Alternatives and similar repositories for VideoGPT-Paper
Users that are interested in VideoGPT-Paper are comparing it to the libraries listed below
Sorting:
- ElasticTok: Adaptive Tokenization for Image and Video☆83Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆92Updated last year
- ☆123Updated 9 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- VQVAE for video prediction☆29Updated 3 years ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆137Updated 9 months ago
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆71Updated last year
- ☆37Updated 9 months ago
- ☆73Updated 3 years ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆85Updated 2 years ago
- ☆49Updated 2 years ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Updated 3 months ago
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆26Updated 6 months ago
- ☆76Updated last year
- ☆48Updated 8 months ago
- Codebase of Truncated Consistency Models (ICLR 2025)☆30Updated 9 months ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆39Updated last year
- ☆71Updated 9 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆81Updated last year
- ☆39Updated 3 years ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆74Updated last year
- A Video Tokenizer Evaluation Dataset☆138Updated 10 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆51Updated last year
- Collaborative Score Distillation for Consistent Visual Synthesis (NeurIPS 2023)☆119Updated 2 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated last year
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆51Updated last year
- Research code for paper "Frido: Feature Pyramid Diffusion for Complex Scene Image Synthesis"☆115Updated last year
- ORES: Open-vocabulary Responsible Visual Synthesis☆13Updated last year
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated 2 years ago
- ☆17Updated 2 years ago