Ingrid725 / Loss-function-summary
☆34Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Loss-function-summary
- Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"☆26Updated 8 months ago
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆61Updated last year
- [NeurIPS2023] Implementation of the paper: Explore In-Context Learning for 3D Point Cloud Understanding☆64Updated 6 months ago
- (AAAI2024) Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models☆46Updated 5 months ago
- Official implementation for [3DV 2024] `Pix4Point: Image Pretrained Standard Transformers for 3D Point Cloud Understanding`☆44Updated 4 months ago
- [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"☆27Updated 6 months ago
- ☆52Updated 11 months ago
- Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"☆51Updated 7 months ago
- The offical implemention of JM3D.☆27Updated last year
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆42Updated 3 months ago
- Code Release of "3D Concept Grounding on Neural Fields (NeurIPS2022)"☆15Updated last year
- Official implementation of "Can Language Understand Depth?"☆76Updated 2 years ago
- ☆17Updated last month
- (CVPR 2023) MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds☆63Updated last year
- Moved to https://github.com/NUS-HPC-AI-Lab/InfoBatch☆6Updated 9 months ago
- Code for IterInpaint model, presented in Diagnostic Benchmark and Iterative Inpainting for Layout-Guided Image Generation (CVPR 2024 work…☆23Updated 3 months ago
- CVPR 2021 Oral https://arxiv.org/abs/2104.02243☆47Updated last year
- Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021☆39Updated 5 months ago
- ETHSeg: An Amodel Instance Segmentation Network and a Real-world Dataset for X-Ray Waste Inspection (CVPR2022)☆14Updated last year
- [WACV'25] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆44Updated 7 months ago
- Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding (ICLR 2023)☆21Updated last year
- [AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…☆35Updated 10 months ago
- [AAAI 2024-Oral] EPCL: Frozen CLIP Transformer is An Efficient Point Cloud Encoder☆29Updated 7 months ago
- (ECCV 2022) DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation☆46Updated 2 years ago
- ☆20Updated 2 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆28Updated 2 years ago
- a thin wrapper of chatgpt for improving paper writing.☆253Updated last year
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆22Updated 3 months ago
- [ICLR 2023] Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning?☆99Updated 4 months ago
- [ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding☆40Updated 2 years ago