Reproduction of the first step in the text-to-video model Phenaki. Code and model weights for the Transformer-based autoencoder for videos called CViViT.
☆29Aug 4, 2023Updated 2 years ago
Alternatives and similar repositories for phenaki-cvivit
Users that are interested in phenaki-cvivit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SFT+RL boosts multimodal reasoning☆48Jun 27, 2025Updated 10 months ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆793Jul 29, 2024Updated last year
- Third-party toolkit for Rope3D dataset☆13Jun 13, 2022Updated 3 years ago
- ☆15Aug 9, 2021Updated 4 years ago
- Implementation of MagViT2 Tokenizer in Pytorch☆660Jan 12, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models☆20Feb 20, 2025Updated last year
- ☆25Jan 12, 2026Updated 3 months ago
- This repo consist of some experimental results on bdd100k datasets using different object detection algorithms(Faster-RCNN, FCOS, ATSS)☆11Jun 27, 2020Updated 5 years ago
- Code Guided Neural Style Transfer for Shape Stylization.☆11Jan 12, 2026Updated 3 months ago
- Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".☆95Mar 31, 2026Updated last month
- ☆131Feb 22, 2025Updated last year
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Mar 27, 2024Updated 2 years ago
- Pytorch implementation of deep fill v2 (original by Jiayu et al.)☆10Jun 26, 2019Updated 6 years ago
- SwiftUI-inspired layout library for PixiJS☆23Apr 21, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆34Apr 18, 2022Updated 4 years ago
- This is a toolbox repository to help evaluate various methods that perform image matching from a pair of images.☆12Jul 5, 2023Updated 2 years ago
- Main code of Dolphins dataset☆16Dec 29, 2022Updated 3 years ago
- python implementation of the paper 'Fast Range Image-Based Segmentation of Sparse 3D Laser Scans for Online Operation'☆12Jan 4, 2021Updated 5 years ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year
- ☆12Oct 12, 2020Updated 5 years ago
- [NAACL 2024] Part-based, explainable and editable fine-grained image classifier that allows users to define a species in text☆14Sep 19, 2025Updated 7 months ago
- Style Transfer by Deep Learning, overview and TensorFlow implementations (UNDER CONSTRUCTION)☆14Jul 25, 2017Updated 8 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Dec 21, 2025Updated 4 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,003Nov 25, 2025Updated 5 months ago
- RAST 1.0: Restorable Arbitrary Style Transfer via Multi-restoration☆13Jun 18, 2024Updated last year
- PyTorch Implementation of MobileDet (https://arxiv.org/abs/2004.14525v3) backbones.☆11Feb 12, 2024Updated 2 years ago
- ☆88Jan 4, 2024Updated 2 years ago
- Annotated Tutorial for PerAct☆19Sep 11, 2023Updated 2 years ago
- A partial implementation of Generative Infinite Vocabulary Transformer (GIVT) from Google Deepmind, in PyTorch.☆21Mar 28, 2024Updated 2 years ago
- ☆42Jun 6, 2025Updated 10 months ago
- Official Pytorch code for "AesUST: Towards Aesthetic-Enhanced Universal Style Transfer" (ACM MM 2022)☆15Dec 31, 2022Updated 3 years ago
- ☆18Aug 29, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- This is the official implementation of paper "Evaluate and Improve the Quality of Neural Style Transfer" (CVIU 2021))☆11Feb 14, 2022Updated 4 years ago
- ☆10Jan 20, 2021Updated 5 years ago
- Unofficial Pytorch Implementation of "A Simple Framework for Contrastive Learning of Visual Representations"☆10Mar 11, 2020Updated 6 years ago
- Toolkit for VIPER benchmark☆15Aug 11, 2020Updated 5 years ago
- Multi-temporal Scene dataset for Scene Change Detection.☆15Apr 14, 2021Updated 5 years ago
- [NeurIPS 2024] Data exporter for SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset☆16Nov 8, 2024Updated last year
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆997Jan 17, 2024Updated 2 years ago