MCG-NJU/PDPP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MCG-NJU/PDPP)

MCG-NJU / PDPP

[CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos

☆34

Alternatives and similar repositories for PDPP

Users that are interested in PDPP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MCG-NJU / FlowBack
View on GitHub
[AAAI 2026] Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment
☆16Dec 9, 2025Updated 7 months ago
WiserZhou / MTID
View on GitHub
Official PyTorch Implementation of Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos
☆11Jul 10, 2026Updated 2 weeks ago
Ravindu-Yasas-Nagasinghe / KEPP
View on GitHub
[CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos
☆12Sep 24, 2024Updated last year
MCG-NJU / JoMoLD
View on GitHub
[ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing
☆27Jul 15, 2022Updated 4 years ago
MCG-NJU / FlowDCN
View on GitHub
[NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution
☆37Dec 23, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MCG-NJU / Video-DC
View on GitHub
☆12Jul 30, 2025Updated 11 months ago
ant-research / LeviTor
View on GitHub
[CVPR'25 Highlight] Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
☆161Apr 15, 2025Updated last year
MCG-NJU / BasicTAD
View on GitHub
BasicTAD: an Astounding RGB-Only Baselinefor Temporal Action Detection
☆52Jun 10, 2023Updated 3 years ago
MCG-NJU / FreeRet
View on GitHub
[ICML2026] FreeRet: MLLMs as Training-Free Retrievers
☆22May 25, 2026Updated 2 months ago
MCG-NJU / VideoEval
View on GitHub
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
☆15Jul 31, 2025Updated 11 months ago
MCG-NJU / TemporalPerceiver
View on GitHub
[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection
☆39Aug 29, 2023Updated 2 years ago
MCG-NJU / DEQDet
View on GitHub
[ICCV 2023] Deep Equilibrium Object Detection
☆28Jun 18, 2025Updated last year
MCG-NJU / STMixer
View on GitHub
[CVPR 2023] STMixer: A One-Stage Sparse Action Detector
☆64May 18, 2023Updated 3 years ago
x4Cx58x54 / vistal
View on GitHub
A visualization tool for temporal action localization (detection/segmentation).
☆13Mar 30, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
facebookresearch / htstep
View on GitHub
HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos
☆26Mar 20, 2024Updated 2 years ago
MCG-NJU / MoG-VFI
View on GitHub
Motion-Aware Generative Frame Interpolation
☆50Mar 11, 2025Updated last year
MCG-NJU / SPLAM
View on GitHub
[ECCV 2024 Oral] SPLAM: Accelerating Image Generation with Sub-path Linear Approximation Model
☆24Nov 1, 2024Updated last year
srijandas07 / clip_baseline_LTA_Ego4d
View on GitHub
Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)
☆15Jul 4, 2022Updated 4 years ago
MCG-NJU / DMM
View on GitHub
DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging
☆47Apr 27, 2025Updated last year
Cogito2012 / PLID
View on GitHub
[ECCV 2024] Prompting Language-Informed Distribution for Compositional Zero-Shot Learning
☆15Jan 4, 2025Updated last year
salesforce / paprika
View on GitHub
Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"
☆50Jun 2, 2026Updated last month
MCG-NJU / Dynamic-MDETR
View on GitHub
[TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding
☆29Sep 11, 2024Updated last year
pritamqu / OOD-VSSL
View on GitHub
[NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts
☆13Jan 30, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pPetrichor / WorldCanvas
View on GitHub
☆146Dec 19, 2025Updated 7 months ago
MCG-NJU / CoMAE
View on GitHub
[AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
☆38Aug 20, 2024Updated last year
jutanke / social_diffusion
View on GitHub
Re-implementation for ICCV23 "Social Diffusion: Long-term Multiple Human Motion Anticipation"
☆24Oct 3, 2023Updated 2 years ago
MCG-NJU / p-MoD
View on GitHub
[ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay
☆44Jun 26, 2025Updated last year
MCG-NJU / HATReID-MOT
View on GitHub
[ECCV 2026] History-Aware Transformation of ReID Features for Multiple Object Tracking
☆36Updated this week
Dawn-LX / OpenVoc-VidVRD
View on GitHub
Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
☆43Jun 4, 2024Updated 2 years ago
derkbreeze / LPT
View on GitHub
Official implementation of the CVPR2022 paper "Learning of Global Objective for Network Flow in Multi-Object Tracking"
☆17Dec 30, 2025Updated 6 months ago
MCG-NJU / VideoMAE-Action-Detection
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE for Action Detection
☆70Feb 3, 2023Updated 3 years ago
MCG-NJU / CaReBench
View on GitHub
A Fine-grained Benchmark for Video Captioning and Retrieval
☆30Jul 16, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
half-potato / DCNv2
View on GitHub
Deformable Convolutional Networks v2 with Pytorch
☆10Jul 29, 2020Updated 5 years ago
WANGSSSSSSS / GS2d_Triton
View on GitHub
Gaussian Splating 2d implemented in triton
☆12Mar 19, 2024Updated 2 years ago
XiangzhuKong / CA-Dense-UNet
View on GitHub
An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement
☆13Jul 17, 2023Updated 3 years ago
olga-zats / GTDA
View on GitHub
[ECCV2024] Gated Temporal Action Anticipation for Stochastic Long-Term Anticipation
☆24May 29, 2025Updated last year
epic-kitchens / C1-Action-Recognition-TSN-TRN-TSM
View on GitHub
EPIC-Kitchens-100 Action Recognition baselines: TSN, TRN, TSM
☆33Mar 15, 2022Updated 4 years ago
Finspire13 / DiffAct
View on GitHub
Code for Diffusion Action Segmentation (ICCV 2023)
☆77Aug 16, 2023Updated 2 years ago
mingu6 / action_seg_ot
View on GitHub
[CVPR 2024] - Official code for the paper "Temporally Consistent Unbalanced Optimal Transport for Unsupervised Action Segmentation"
☆54Aug 22, 2024Updated last year