wilson1yan / VideoGPT-PaperLinks
☆18Updated 4 years ago
Alternatives and similar repositories for VideoGPT-Paper
Users that are interested in VideoGPT-Paper are comparing it to the libraries listed below
Sorting:
- ElasticTok: Adaptive Tokenization for Image and Video☆87Updated last year
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆93Updated 2 years ago
- ☆130Updated 11 months ago
- ☆73Updated 3 years ago
- VQVAE for video prediction☆31Updated 3 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated last year
- Official Release of NeurIPS 2023 Spotlight paper "Object-Centric Slot Diffusion"☆72Updated last year
- ☆38Updated 11 months ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆46Updated 3 years ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆142Updated 11 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆81Updated last year
- Official Code for Neural Systematic Binder☆34Updated 2 years ago
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Updated 2 years ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆52Updated last year
- ☆53Updated 2 years ago
- ☆40Updated 3 years ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆85Updated 2 years ago
- This is the official source code for SLATE. We provide the code for the model, the training code, and a dataset loader for the 3D Shapes …☆88Updated 3 years ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Updated 5 months ago
- (CVPR 2023) Seeing a Rose in Five Thousand Ways☆119Updated 2 years ago
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆28Updated 8 months ago
- A Video Tokenizer Evaluation Dataset☆147Updated last year
- Collaborative Score Distillation for Consistent Visual Synthesis (NeurIPS 2023)☆121Updated 2 years ago
- Official code for Slot-Transformer for Videos (STEVE)☆50Updated 3 years ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆78Updated last year
- HD-EPIC Python script to download the entire datasets or parts of it☆15Updated 3 months ago
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆52Updated last year
- ☆17Updated 2 years ago
- ☆72Updated 11 months ago
- Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"☆44Updated 2 years ago