Reproduction of the first step in the text-to-video model Phenaki. Code and model weights for the Transformer-based autoencoder for videos called CViViT.
☆29Aug 4, 2023Updated 2 years ago
Alternatives and similar repositories for phenaki-cvivit
Users that are interested in phenaki-cvivit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SFT+RL boosts multimodal reasoning☆48Jun 27, 2025Updated last year
- Large Language-and-Vision Assistant for BioMedicine, built towards multimodal GPT-4 level capabilities.☆10Nov 29, 2023Updated 2 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- [ICLR 2026] [NeurIPS 2025] ViPRA: Video Prediction for Robot Actions☆45Jan 27, 2026Updated 5 months ago
- Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch☆792Jul 29, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Third-party toolkit for Rope3D dataset☆13Jun 13, 2022Updated 4 years ago
- Implementation of MagViT2 Tokenizer in Pytorch☆666Jan 12, 2025Updated last year
- ☆28Jan 12, 2026Updated 5 months ago
- FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models, ICCV 2023☆13Jul 13, 2024Updated last year
- This repo consist of some experimental results on bdd100k datasets using different object detection algorithms(Faster-RCNN, FCOS, ATSS)☆11Jun 27, 2020Updated 6 years ago
- Code Guided Neural Style Transfer for Shape Stylization.☆11Jan 12, 2026Updated 5 months ago
- Lidar line downsampling for KITTI dataset, transfer lidar the number of lidar lines from 64 to 32, 16, 8, etc.☆13Jun 3, 2020Updated 6 years ago
- code for the paper Imitation Learning from Observation with Automatic Discount Scheduling☆13Mar 27, 2024Updated 2 years ago
- Pytorch implementation of deep fill v2 (original by Jiayu et al.)☆10Jun 26, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection☆34Apr 18, 2022Updated 4 years ago
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆16Apr 1, 2025Updated last year
- Main code of Dolphins dataset☆16Dec 29, 2022Updated 3 years ago
- python implementation of the paper 'Fast Range Image-Based Segmentation of Sparse 3D Laser Scans for Online Operation'☆13Jan 4, 2021Updated 5 years ago
- Implementing the paper☆15Nov 5, 2016Updated 9 years ago
- [NAACL 2024] Part-based, explainable and editable fine-grained image classifier that allows users to define a species in text☆14Sep 19, 2025Updated 9 months ago
- Style Transfer by Deep Learning, overview and TensorFlow implementations (UNDER CONSTRUCTION)☆14Jul 25, 2017Updated 8 years ago
- SEED-Voken: A Series of Powerful Visual Tokenizers☆1,012Nov 25, 2025Updated 7 months ago
- RAST 1.0: Restorable Arbitrary Style Transfer via Multi-restoration☆13Jun 18, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Scalable Semi-Supervised Learning by Efficient Anchor Graph Regularization☆13Jun 20, 2018Updated 8 years ago
- KWS demo based on CTC prefix beam search.☆19Oct 21, 2023Updated 2 years ago
- ☆88Jan 4, 2024Updated 2 years ago
- Google MobileNets Implementation using Tensorflow☆18Jun 6, 2017Updated 9 years ago
- ALFASVMLib - A Matlab library for adversarial label flip attacks against SVMs☆13Jun 19, 2015Updated 11 years ago
- A partial implementation of Generative Infinite Vocabulary Transformer (GIVT) from Google Deepmind, in PyTorch.☆21Mar 28, 2024Updated 2 years ago
- ☆43Jun 6, 2025Updated last year
- Official Pytorch code for "AesUST: Towards Aesthetic-Enhanced Universal Style Transfer" (ACM MM 2022)☆15Dec 31, 2022Updated 3 years ago
- ☆21Aug 29, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is the official implementation of paper "Evaluate and Improve the Quality of Neural Style Transfer" (CVIU 2021))☆11Feb 14, 2022Updated 4 years ago
- A PyTorch re-implementation of Weakly Supervised Facial Action Unit Recognition through Adversarial Training☆10Apr 23, 2019Updated 7 years ago
- This project explores the different techniques (both scalable and non scalable) for Graph based semi supervised learning. Recent techniqu…☆14May 28, 2016Updated 10 years ago
- Toolkit for VIPER benchmark☆16Aug 11, 2020Updated 5 years ago
- Multi-temporal Scene dataset for Scene Change Detection.☆15Apr 14, 2021Updated 5 years ago
- Official JAX implementation of MAGVIT: Masked Generative Video Transformer☆1,000Jan 17, 2024Updated 2 years ago
- A list of robotics related papers accepted by ICLR'25☆25Aug 28, 2025Updated 10 months ago