MCG-NJU / AMDView external linksLinks
[CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models
☆18Jan 11, 2026Updated last month
Alternatives and similar repositories for AMD
Users that are interested in AMD are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] AWT: Transferring Vision-Language Models via Augmentation, Weighting, and Transportation☆115Oct 5, 2024Updated last year
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆14Jul 31, 2025Updated 6 months ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆38Aug 29, 2023Updated 2 years ago
- [CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos☆17May 21, 2024Updated last year
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆26Oct 16, 2023Updated 2 years ago
- Target Transformed Regression for Accurate Tracking☆21Dec 5, 2021Updated 4 years ago
- A Fine-grained Benchmark for Video Captioning and Retrieval☆26Jul 16, 2025Updated 6 months ago
- [CVPR 2021] CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation☆24Jan 30, 2022Updated 4 years ago
- [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding☆29Sep 11, 2024Updated last year
- [AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets☆38Aug 20, 2024Updated last year
- VideoNSA: Native Sparse Attention Scales Video Understanding☆78Nov 16, 2025Updated 2 months ago
- [ICCV 2025] p-MoD: Building Mixture-of-Depths MLLMs via Progressive Ratio Decay☆43Jun 26, 2025Updated 7 months ago
- AIRS-2025赛道二:「星际矿脉」火星矿物高光谱分类挑战赛☆12May 7, 2025Updated 9 months ago
- ☆22Jul 18, 2025Updated 6 months ago
- ☆10Jan 25, 2026Updated 2 weeks ago
- ☆12Sep 11, 2021Updated 4 years ago
- A large-scale training and benchmarking framework for rPPG.☆10Nov 26, 2024Updated last year
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆46Nov 24, 2023Updated 2 years ago
- Gaussian Splating 2d implemented in triton☆11Mar 19, 2024Updated last year
- Open Set Video HOI detection from Action-centric Chain-of-Look Prompting, ICCV2023☆12Oct 3, 2023Updated 2 years ago
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- [TNNLS 2022] Official pytorch implementation of "Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions"☆11Apr 19, 2022Updated 3 years ago
- A collection of papers tackling automatic fact-checking (particularly of AI-generated content)☆14Nov 3, 2023Updated 2 years ago
- TESGNN: 3D Temporal Equivariant Scene Graph Neural Networks (published at TMLR)☆14Nov 2, 2025Updated 3 months ago
- Code for TCSVT paper "Exploring Spatio-Temporal Graph Convolution for Video-based Human-Object Interaction Recognition"☆12Mar 30, 2023Updated 2 years ago
- This repository provides the code for the methods and experiments presented in our paper 'Dual-stream Class-adaptive Network for Semi-sup…☆11Feb 29, 2024Updated last year
- Code to reproduce experiments in Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing Flows☆13May 23, 2024Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆109Jul 24, 2023Updated 2 years ago
- [CVPR 2022] Task-specific Inconsistency Alignment for Domain Adaptive Object Detection☆40Jul 20, 2022Updated 3 years ago
- ☆11Dec 20, 2021Updated 4 years ago
- [CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection☆55Mar 6, 2023Updated 2 years ago
- ☆12Jan 10, 2025Updated last year
- A DEMO for “Local Transformer With Spatial Partition Restore for Hyperspectral Image Classification (Xue et al., JSTARS, 2022)”☆16Apr 17, 2024Updated last year
- The OBMO module embedded in PatchNet☆10Feb 21, 2024Updated last year
- [AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)☆13Jan 10, 2024Updated 2 years ago
- [ACM MM 2023] PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation☆12Aug 28, 2023Updated 2 years ago
- This is Pytorch implementation of our paper "LF-ViT: Reducing Spatial Redundancy in Vision Transformer for Efficient Image Recognition".☆11Sep 23, 2024Updated last year
- Solving Physics Puzzles by Reasoning about Paths (NeurIPS 2020 workshop)☆14Jun 28, 2022Updated 3 years ago
- A semantic segmentation method for high resolution image☆12Jul 1, 2022Updated 3 years ago