yikaiw / TokenFusionView external linksLinks
[CVPR 2022] Code release for "Multimodal Token Fusion for Vision Transformers"
☆183Jul 21, 2022Updated 3 years ago
Alternatives and similar repositories for TokenFusion
Users that are interested in TokenFusion are comparing it to the libraries listed below
Sorting:
- [TPAMI 2023, NeurIPS 2020] Code release for "Deep Multimodal Fusion by Channel Exchanging"☆312Jul 14, 2024Updated last year
- ☆410Sep 2, 2024Updated last year
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- Boosting 3D Object Detection via Object-Focused Image Fusion☆59Sep 11, 2022Updated 3 years ago
- Use python3 to convert depth image into hha image☆195Apr 12, 2024Updated last year
- [ECCV 2020] PyTorch Implementation of some RGBD Semantic Segmentation models.☆324Aug 17, 2020Updated 5 years ago
- Paper List for In-context Learning 🌷☆20Jan 3, 2023Updated 3 years ago
- ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation (ICCV 2021)☆114Aug 30, 2021Updated 4 years ago
- MultiMAE: Multi-modal Multi-task Masked Autoencoders, ECCV 2022☆614Dec 13, 2022Updated 3 years ago
- [NeurIPS2022] Let Images Give You More: Point Cloud Cross-Modal Training for Shape Analysis☆74Jan 30, 2023Updated 3 years ago
- ☆92Aug 18, 2024Updated last year
- Group-Free 3D Object Detection via Transformers☆257Jun 2, 2021Updated 4 years ago
- [NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification☆648Jul 11, 2023Updated 2 years ago
- Repository of DELIVER dataset and CMNeXt models (CVPR 2023)☆200Aug 16, 2024Updated last year
- Cascade Graph Neural Networks for RGB-D Salient Object Detection (ECCV20)☆49Jan 28, 2022Updated 4 years ago
- Feature_reconstruction_Network_for_RGB-D_Semantic_Segmentation☆12Apr 28, 2023Updated 2 years ago
- [CVPR2023] This is an official implementation of paper "DETRs with Hybrid Matching".☆14Sep 1, 2022Updated 3 years ago
- Official Implementation of DE-CondDETR and DELA-CondDETR in "Towards Data-Efficient Detection Transformers"☆45Aug 25, 2022Updated 3 years ago
- Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)☆470Sep 19, 2022Updated 3 years ago
- ☆19May 27, 2023Updated 2 years ago
- Source Code for paper "Infrared and Visible Image Fusion via Parallel Scene and Texture Learning".☆17Aug 3, 2022Updated 3 years ago
- A PyTorch implementation of paper 'Self-supervised Depth Completion from Direct Visual-LiDAR Odometry in Autonomous Driving'.☆18Sep 16, 2020Updated 5 years ago
- [CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting☆542Sep 15, 2023Updated 2 years ago
- [NeurIPS'22] An official PyTorch implementation of PTv2.☆430Jun 4, 2023Updated 2 years ago
- A paper list of RGBD semantic segmentation (processing)☆416Oct 7, 2023Updated 2 years ago
- SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object Detection☆95Feb 18, 2022Updated 3 years ago
- [ICLR 2023] Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?☆103Jul 1, 2024Updated last year
- [ECCV2022] This is an official implementation of paper "RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentati…☆78Feb 12, 2023Updated 3 years ago
- [NeurIPS 2022 Spotlight] P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting☆132Jul 26, 2023Updated 2 years ago
- ☆34Nov 12, 2023Updated 2 years ago
- ImVoteNet: Boosting 3D Object Detection in Point Clouds With Image Votes☆134Nov 17, 2022Updated 3 years ago
- Omnivore: A Single Model for Many Visual Modalities☆571Nov 12, 2022Updated 3 years ago
- Bi-directional Adapter for Multi-modal Tracking☆96Mar 19, 2024Updated last year
- A Unified Efficient Pyramid Transformer for Semantic Segmentation, ICCVW 2021☆31Oct 11, 2021Updated 4 years ago
- MDRNet+:Mitigating Modality Discrepancies for RGB-T Semantic Segmentation (ABMDRNet extended version)☆22Feb 9, 2023Updated 3 years ago
- ☆22May 30, 2023Updated 2 years ago
- Hierarchical Multi-modal Fusion Tracker for RGB-T tracking (CVPR2022)☆51Mar 25, 2023Updated 2 years ago
- [ICCV'21] Pri3D: Can 3D Priors Help 2D Representation Learning?☆150Dec 17, 2021Updated 4 years ago
- Per-Pixel Classification is Not All You Need for Semantic Segmentation (NeurIPS 2021, spotlight)☆1,450Mar 11, 2022Updated 3 years ago