[AAAI 2023 Oral] CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets
☆38Aug 20, 2024Updated last year
Alternatives and similar repositories for CoMAE
Users that are interested in CoMAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation☆27Mar 28, 2025Updated last year
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆26Oct 16, 2023Updated 2 years ago
- Data pre-processing and training code on Open-X-Embodiment with pytorch☆11Jan 20, 2025Updated last year
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 9 months ago
- [ECCV 2022] Joint-Modal Label Denoising for Weakly-Supervised Audio-Visual Video Parsing☆27Jul 15, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆35Dec 23, 2024Updated last year
- [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling☆21Jul 7, 2025Updated 10 months ago
- [ICLR 2025] Think Then React: Towards Unconstrained Action-to-Reaction Motion Generation☆20Mar 21, 2025Updated last year
- [CVPR 2021] CGA-Net: Category Guided Aggregation for Point Cloud Semantic Segmentation☆24Jan 30, 2022Updated 4 years ago
- ☆15Oct 21, 2023Updated 2 years ago
- [CVPR 2024] Sparse Global Matching for Video Frame Interpolation with Large Motion☆81Jul 4, 2024Updated last year
- [ECCV 2024] ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video☆23Jul 29, 2024Updated last year
- ☆45Dec 9, 2024Updated last year
- Learning to Annotate Part Segmentation with Gradient Matching (ICLR 2022)☆12Apr 26, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV 2023] Deep Equilibrium Object Detection☆28Jun 18, 2025Updated 11 months ago
- [TIP] APP-Net: Auxiliary-point-based Push and Pull Operations for Efficient Point Cloud Recognition☆13May 15, 2023Updated 3 years ago
- [CVPR 2023 Hightlight] PDPP: Projected Diffusion for Procedure Planning in Instructional Videos☆34Aug 30, 2023Updated 2 years ago
- [CVPR 2024] SportsHHI: A Dataset for Human-Human Interaction Detection in Sports Videos☆17May 21, 2024Updated 2 years ago
- [CVPR 2024] Asymmetric Masked Distillation for Pre-Training Small Foundation Models☆18Jan 11, 2026Updated 4 months ago
- ☆13Jun 25, 2016Updated 9 years ago
- [CVPR 2023] STMixer: A One-Stage Sparse Action Detector☆63May 18, 2023Updated 3 years ago
- ☆11May 28, 2024Updated 2 years ago
- Official implementation of the paper "Complementary Random Masking for RGB-T Semantic Segmentation."☆64Mar 16, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2023] MixFormerV2: Efficient Fully Transformer Tracking☆216Apr 20, 2024Updated 2 years ago
- [ICCV 2023] MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking☆232Oct 15, 2025Updated 7 months ago
- [Knowledge-Based Systems] Exploring Attention Mechanism for Graph Similarity Learning☆20Feb 22, 2024Updated 2 years ago
- [MMM 2025 Best Paper] RoLD: Robot Latent Diffusion for Multi-Task Policy Modeling☆24Aug 4, 2024Updated last year
- ☆20Oct 8, 2024Updated last year
- The official repo of our work "Pensieve: Retrospect-then-Compare mitigates Visual Hallucination"☆15May 4, 2024Updated 2 years ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)☆20Nov 4, 2024Updated last year
- code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)☆24Feb 3, 2023Updated 3 years ago
- Toolbox to evaluate categorical pose and shape estimation methods☆28Feb 23, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Awesome lists about all kinds of softwares. 记录使用过的一些比较好用的软件。☆19Oct 7, 2025Updated 7 months ago
- [ICCV 2023] SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes☆213Jul 24, 2023Updated 2 years ago
- [CVPR 2023] LinK: Linear Kernel for LiDAR-based 3D Perception☆95Jul 27, 2024Updated last year
- Official Code for ECCV2022: Learning Semantic Correspondence with Sparse Annotations☆18Aug 22, 2022Updated 3 years ago
- ☆33Sep 25, 2024Updated last year
- [CVPR 2022] Task-specific Inconsistency Alignment for Domain Adaptive Object Detection☆40Jul 20, 2022Updated 3 years ago
- Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization☆27Jun 14, 2022Updated 3 years ago