fukexue / POS-BERT
☆19Updated 5 months ago
Alternatives and similar repositories for POS-BERT:
Users that are interested in POS-BERT are comparing it to the libraries listed below
- [CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds☆53Updated 2 years ago
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models☆52Updated 9 months ago
- [ICLR 2023] Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?☆102Updated 8 months ago
- This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).☆27Updated last year
- [NeurIPS2022] Let Images Give You More: Point Cloud Cross-Modal Training for Shape Analysis☆73Updated 2 years ago
- [ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds☆41Updated 2 years ago
- Official implementation for [3DV 2024] `Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding`☆47Updated 8 months ago
- [AAAI 2024-Oral] EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder☆29Updated 11 months ago
- [ICCV 2023] Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models☆40Updated 7 months ago
- (ECCV 2022) DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation☆49Updated 2 years ago
- The code for the paper "Towards Compact 3D Representations via Point Feature Enhancement Masked Autoencoders" (AAAI'24).☆36Updated last year
- Official Implementation for "Mask-Attention-Free Transformer for 3D Instance Segmentation"☆63Updated last year
- ☆19Updated 10 months ago
- [AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…☆39Updated last year
- Multi-View Transformer for 3D Visual Grounding [CVPR 2022]☆73Updated 2 years ago
- Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding (ICLR 2023)☆21Updated last year
- [ICML 2023] Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining☆140Updated 7 months ago
- [ECCV 2022] Masked Discrimination for Self-Supervised Learning on Point Clouds☆94Updated 2 years ago
- [ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding☆43Updated 2 years ago
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆68Updated 3 months ago
- [NeurIPS 2022 Spotlight] P2P: Tuning Pre-trained Image Models for Point Cloud Analysis with Point-to-Pixel Prompting☆129Updated last year
- The offical implemention of JM3D.☆29Updated last year
- SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)☆32Updated 3 years ago
- [NeurIPS'23] ConDaFormer: Disassembled Transformer with Local Structure Enhancement for 3D Point Cloud Understanding☆11Updated last year
- Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer", our ECCV 2024 paper "OV-Uni3DETR: Towards Unified O…☆95Updated 7 months ago
- [IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)☆20Updated 2 years ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆61Updated 2 years ago
- [ECCV 2022] "Point Cloud Domain Adaptation via Masked Local 3D Structure Prediction" by Hanxue Liang, Hehe Fan, Zhiwen Fan, Yi Wang, Tian…☆18Updated last year
- code of [CVPR22] CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance☆17Updated 2 years ago
- ☆51Updated last year