ywyue / FiT3DLinks
[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning
☆306Updated last week
Alternatives and similar repositories for FiT3D
Users that are interested in FiT3D are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆214Updated 7 months ago
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆104Updated 9 months ago
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆339Updated 3 weeks ago
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆225Updated 5 months ago
- [ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction☆392Updated 6 months ago
- [ICCV 2025] Zero-Shot Monocular Depth Completion with Guided Diffusion☆226Updated last month
- Code for the paper "pix2gestalt: Amodal Segmentation by Synthesizing Wholes" (CVPR 2024)☆192Updated 5 months ago
- [CVPR2024] SANeRF-HQ: Segment Anything for NeRF in High Quality.☆50Updated last year
- [ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"☆243Updated last month
- Official implementation of DepthLM☆275Updated 2 months ago
- Official implementation of "Local All-Pair Correspondence for Point Tracking" (ECCV 2024)☆203Updated 8 months ago
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆222Updated last month
- [NeurIPS 2025] Official code for JAFAR: Jack up Any Feature at Any Resolution☆211Updated last month
- IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction☆299Updated 3 weeks ago
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.☆495Updated 8 months ago
- 🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views☆300Updated last year
- Pytorch Code for "LEGaussians: Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding"☆155Updated last year
- [CVPR 2025 Highlight] Align3R: Aligned Monocular Depth Estimation for Dynamic Videos☆453Updated 8 months ago
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆169Updated 2 months ago
- A simple state update rule to enhance length generalization for CUT3R☆540Updated 2 months ago
- ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors☆275Updated 9 months ago
- Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image☆291Updated 6 months ago
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆195Updated 7 months ago
- [ECCV 2024] Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation☆65Updated 11 months ago
- [NeurIPS 2025] Pixel-Perfect Depth☆695Updated this week
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS model☆313Updated last year
- Stereo4D dataset and processing code☆283Updated last month
- Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)☆176Updated last month
- [NeurIPS 2025] LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS☆162Updated 2 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆145Updated 5 months ago