MCG-NJU / CoMAEView external linksLinks
[AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
☆38Aug 20, 2024Updated last year
Alternatives and similar repositories for CoMAE
Users that are interested in CoMAE are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆55Apr 1, 2025Updated 10 months ago
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆26Oct 16, 2023Updated 2 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆14Jul 31, 2025Updated 6 months ago
- [CVPR 2021] CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation☆24Jan 30, 2022Updated 4 years ago
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- Unofficial implementation of PointNet and PointNet++☆10Oct 26, 2023Updated 2 years ago
- ☆15Oct 21, 2023Updated 2 years ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆34Dec 23, 2024Updated last year
- ☆13Jun 25, 2016Updated 9 years ago
- [TIP] APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Recognition☆13May 15, 2023Updated 2 years ago
- [ICLR 2025] Think Then React: Towards Unconstrained Action-to-Reaction Motion Generation☆19Mar 21, 2025Updated 10 months ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆38Aug 29, 2023Updated 2 years ago
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆23Jul 29, 2024Updated last year
- [CVPR 2024] Sparse Global Matching for Video Frame Interpolation with Large Motion☆78Jul 4, 2024Updated last year
- [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling☆21Jul 7, 2025Updated 7 months ago
- [Knowledge-Based Systems] Exploring Attention Mechanism for Graph Similarity Learning☆20Feb 22, 2024Updated last year
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Jan 11, 2026Updated last month
- [CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos☆17May 21, 2024Updated last year
- ☆20Oct 8, 2024Updated last year
- [MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling☆22Aug 4, 2024Updated last year
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63May 18, 2023Updated 2 years ago
- Toolbox to evaluate categorical pose and shape estimation methods☆29Feb 23, 2024Updated last year
- Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization☆27Jun 14, 2022Updated 3 years ago
- Official implementation of the paper "Complementary Random Masking for RGB-T Semantic Segmentation."☆63Mar 16, 2024Updated last year
- A Real-Time Depth Sensor Simulator with GPU Acceleration☆35Aug 7, 2025Updated 6 months ago
- [NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking☆201Apr 20, 2024Updated last year
- [ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking☆215Oct 15, 2025Updated 4 months ago
- [TPAMI 2024] Dynamic MDETR: A Dynamic Multimodal Transformer Decoder for Visual Grounding☆29Sep 11, 2024Updated last year
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆33Aug 30, 2023Updated 2 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆30Dec 1, 2022Updated 3 years ago
- (AAAI 2022) Self-Supervised Pretraining for RGB-D Salient Object Detection☆101Apr 8, 2022Updated 3 years ago
- [IEEE TCSVT 2025] Bidirectional-Modulation Frequency-Heterogeneous Network for Remote Sensing Image Dehazing☆15Jul 27, 2025Updated 6 months ago
- ☆36Dec 31, 2024Updated last year
- Research at Washington University in St. Louis - Erik Wijmans☆37Oct 10, 2018Updated 7 years ago
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- A source code in C++ and OpenCV with a Qt UI for manual registration of two images, named reference and moving. Now, Affine and Homograp…☆10Jun 6, 2021Updated 4 years ago
- ☆38Sep 18, 2022Updated 3 years ago
- Transfer PaddlePaddle's codes to TensorLayerX's codes☆10Feb 10, 2023Updated 3 years ago