wilson1yan / VideoGPT-PaperLinks
☆18Updated 4 years ago
Alternatives and similar repositories for VideoGPT-Paper
Users that are interested in VideoGPT-Paper are comparing it to the libraries listed below
Sorting:
- ☆121Updated 6 months ago
- ElasticTok: Adaptive Tokenization for Image and Video☆75Updated 9 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆90Updated last year
- VQVAE for video prediction☆27Updated 3 years ago
- ☆38Updated 6 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆47Updated 3 years ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆34Updated last year
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆67Updated last year
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆85Updated last year
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆127Updated 6 months ago
- Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"☆27Updated 2 years ago
- ☆73Updated 3 years ago
- ☆50Updated last year
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆39Updated last month
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆86Updated 2 years ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆50Updated last year
- Collaborative Score Distillation for Consistent Visual Synthesis (NeurIPS 2023)☆120Updated last year
- Codebase of Truncated Consistency Models (ICLR 2025)☆29Updated 7 months ago
- ☆77Updated last year
- ☆10Updated 2 years ago
- ☆48Updated 5 months ago
- ☆69Updated 7 months ago
- ☆41Updated last year
- Official repo for the TMLR paper "Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners"☆30Updated last year
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆25Updated 3 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated 11 months ago
- ☆16Updated 2 years ago
- CycleReward is a reward model trained on cycle consistency preferences to measure image-text alignment.☆38Updated 3 weeks ago
- ☆39Updated 3 years ago
- A Video Tokenizer Evaluation Dataset☆130Updated 7 months ago