ywyue / FiT3DLinks
[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning
☆307Updated 3 weeks ago
Alternatives and similar repositories for FiT3D
Users that are interested in FiT3D are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆343Updated last month
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆104Updated 9 months ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆218Updated 7 months ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆225Updated 2 months ago
- [CVPR2024] SANeRF-HQ: Segment Anything for NeRF in High Quality.☆50Updated last year
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆175Updated 3 months ago
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆214Updated last month
- [ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction☆396Updated 7 months ago
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆228Updated 5 months ago
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.☆498Updated 9 months ago
- Official implementation of DepthLM☆283Updated last week
- [ECCV2024] [3DV Nectar 2025] FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally☆243Updated last year
- [ICCV 2025] Zero-Shot Monocular Depth Completion with Guided Diffusion☆227Updated 2 months ago
- Pytorch Code for "LEGaussians: Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding"☆157Updated last year
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆196Updated 8 months ago
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆244Updated 2 months ago
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS model☆313Updated last year
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆192Updated 6 months ago
- Official implementation of "Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation".☆168Updated 2 weeks ago
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆177Updated 2 months ago
- 🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views☆302Updated last year
- [CVPR 2025 Highlight] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos☆453Updated 9 months ago
- [NeurIPS 2025] LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS☆177Updated 2 months ago
- ☆461Updated 4 months ago
- A simple state update rule to enhance length generalization for CUT3R☆557Updated 3 months ago
- [ICLR 2024] OpenSet 3D Neural Scene Segmentation with Pixel-wise Features and Rendered Novel Views☆143Updated 8 months ago
- IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction☆314Updated last month
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆362Updated last year
- Orient Anything, ICML 2025☆369Updated 2 months ago
- Stereo4D dataset and processing code☆283Updated 2 months ago