obvious-research/phenaki-cvivit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/obvious-research/phenaki-cvivit)

obvious-research / phenaki-cvivit

Reproduction of the first step in the text-to-video model Phenaki. Code and model weights for the Transformer-based autoencoder for videos called CViViT.

☆29

Alternatives and similar repositories for phenaki-cvivit

Users that are interested in phenaki-cvivit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

franciszchen / SurgBox
View on GitHub
☆16Dec 14, 2024Updated last year
waltonfuture / RL-with-Cold-Start
View on GitHub
SFT+RL boosts multimodal reasoning
☆47Jun 27, 2025Updated last year
Kartik17 / Pedestrian-Region-Proposal-using-Lidar
View on GitHub
Region Proposal generation on images using clustering in Pointcloud - Currently only for Pedestrians
☆11Jul 13, 2020Updated 6 years ago
lucidrains / magvit2-pytorch
View on GitHub
Implementation of MagViT2 Tokenizer in Pytorch
☆668Jan 12, 2025Updated last year
abhinavtripathi95 / feature-tools
View on GitHub
This repository contains tools for visualization of keypoint matches over two images (ORB, SIFT, LIFT, SuperPoint, D2-Net).
☆13Jul 23, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jianglongye / featurenerf
View on GitHub
FeatureNeRF: Learning Generalizable NeRFs by Distilling Foundation Models, ICCV 2023
☆13Jul 13, 2024Updated 2 years ago
TengFeiHan0 / Object-Detection.pytorch
View on GitHub
This repo consist of some experimental results on bdd100k datasets using different object detection algorithms(Faster-RCNN, FCOS, ATSS)
☆11Jun 27, 2020Updated 6 years ago
thu-nics / FlashEval
View on GitHub
☆14Aug 9, 2024Updated last year
chengtianle1997 / Lidar_Line_Downsample
View on GitHub
Lidar line downsampling for KITTI dataset, transfer lidar the number of lidar lines from 64 to 32, 16, 8, etc.
☆13Jun 3, 2020Updated 6 years ago
arthurhero / deep_fill_2_pytorch
View on GitHub
Pytorch implementation of deep fill v2 (original by Jiayu et al.)
☆10Jun 26, 2019Updated 7 years ago
Sharpiless / Pix2seq-mmdetection
View on GitHub
Unofficial implement of "Pix2seq: A Language Modeling Framework for Object Detection" on mmdetection
☆34Apr 18, 2022Updated 4 years ago
wilson1yan / teco
View on GitHub
☆132Feb 22, 2025Updated last year
dwjshift / IL_ADS
View on GitHub
code for the paper Imitation Learning from Observation with Automatic Discount Scheduling
☆13Mar 27, 2024Updated 2 years ago
fusiming3 / MARS
View on GitHub
Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
☆86Jul 16, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
explosion5 / Dolphins
View on GitHub
Main code of Dolphins dataset
☆16Dec 29, 2022Updated 3 years ago
TruongKhang / image-matching-toolbox
View on GitHub
This is a toolbox repository to help evaluate various methods that perform image matching from a pair of images.
☆12Jul 5, 2023Updated 3 years ago
liulai / reconet-torch
View on GitHub
☆12Oct 12, 2020Updated 5 years ago
TencentARC / SEED-Voken
View on GitHub
SEED-Voken: A Series of Powerful Visual Tokenizers
☆1,017Nov 25, 2025Updated 7 months ago
YingnanMa / RAST
View on GitHub
RAST 1.0: Restorable Arbitrary Style Transfer via Multi-restoration
☆13Jun 18, 2024Updated 2 years ago
wisebobo / doc_ocr_by_template
View on GitHub
This is an OCR program designed for travel document. It can now support 23 types of documents with pre-defined template. You can add what…
☆10Nov 22, 2022Updated 3 years ago
JGuillaumin / style-transfer-workshop
View on GitHub
Style Transfer by Deep Learning, overview and TensorFlow implementations (UNDER CONSTRUCTION)
☆14Jul 25, 2017Updated 8 years ago
huggingface / amused
View on GitHub
☆89Jan 4, 2024Updated 2 years ago
sunjieee / MobileNets-Tensorflow
View on GitHub
Google MobileNets Implementation using Tensorflow
☆18Jun 6, 2017Updated 9 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
elyxlz / givt-pytorch
View on GitHub
A partial implementation of Generative Infinite Vocabulary Transformer (GIVT) from Google Deepmind, in PyTorch.
☆21Mar 28, 2024Updated 2 years ago
G-U-N / Awesome-Pixel-Flow
View on GitHub
☆38Dec 25, 2025Updated 6 months ago
donydchen / ran_replicate
View on GitHub
A PyTorch re-implementation of Weakly Supervised Facial Action Unit Recognition through Adversarial Training
☆10Apr 23, 2019Updated 7 years ago
peract / peract_colab
View on GitHub
Annotated Tutorial for PerAct
☆19Sep 11, 2023Updated 2 years ago
zhechen / Deformable-DETR-REGO
View on GitHub
☆41Sep 21, 2023Updated 2 years ago
TsungWeiTsai / SimCLR
View on GitHub
Unofficial Pytorch Implementation of "A Simple Framework for Contrastive Learning of Visual Representations"
☆10Mar 11, 2020Updated 6 years ago
facebookresearch / EgoToM
View on GitHub
EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …
☆16Apr 1, 2025Updated last year
EndyWon / StyleEval
View on GitHub
This is the official implementation of paper "Evaluate and Improve the Quality of Neural Style Transfer" (CVIU 2021))
☆11Feb 14, 2022Updated 4 years ago
DaertML / context_distillation
View on GitHub
Framework to achieve context distillation in LLMs
☆15Nov 24, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rulixiang / MtS-WH-Dataset
View on GitHub
Multi-temporal Scene dataset for Scene Change Detection.
☆15Apr 14, 2021Updated 5 years ago
THU-LYJ-Lab / SS3DM-Exporter
View on GitHub
[NeurIPS 2024] Data exporter for SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset
☆16Nov 8, 2024Updated last year
google-research / magvit
View on GitHub
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
☆1,002Jan 17, 2024Updated 2 years ago
srrichter / viper
View on GitHub
Toolkit for VIPER benchmark
☆16Aug 11, 2020Updated 5 years ago
layumi / To-Academic-Newcomers
View on GitHub
☆10Jan 20, 2021Updated 5 years ago
hairuoliu1 / ICLR-2025-Robotics
View on GitHub
A list of robotics related papers accepted by ICLR'25
☆25Aug 28, 2025Updated 10 months ago
lalithjets / SurgicalGPT
View on GitHub
☆28Feb 7, 2024Updated 2 years ago