Code for "Recognizing Scenes from Novel Viewpoints"
☆29Sep 16, 2022Updated 3 years ago
Alternatives and similar repositories for viewseg
Users that are interested in viewseg are comparing it to the libraries listed below
Sorting:
- [ICCV 2021 - Oral] Bootstrap Your Own Correspondences☆41Dec 10, 2021Updated 4 years ago
- A Unified Framework for Transforming between Text, Point Cloud, and Program☆19Jul 3, 2025Updated 8 months ago
- [CVPR 2022] Understanding 3D Object Articulation in Internet Videos☆33Mar 7, 2024Updated last year
- Code for CVPR 2022 paper "Comparing Correspondences: Video Prediction with Correspondence-wise Losses"☆25Jul 1, 2022Updated 3 years ago
- Repository for the paper "Data Efficient Masked Language Modeling for Vision and Language".☆18Sep 17, 2021Updated 4 years ago
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)☆10Oct 11, 2021Updated 4 years ago
- MessyTable: Instance Association in Multiple Camera Views☆38Dec 30, 2025Updated 2 months ago
- The implemention of Video Depth Estimation by Fusing Flow-to-Depth Proposals☆35Feb 21, 2021Updated 5 years ago
- Solution of Kaggle competition: Feedback Prize - Evaluating Student Writing☆16Mar 30, 2022Updated 3 years ago
- Website and Code for Directed Ray Distance Functions for 3D Scene Reconstruction☆38Sep 13, 2023Updated 2 years ago
- ☆17Apr 7, 2022Updated 3 years ago
- Code for Novel View Acoustic Synthesis paper☆51Aug 14, 2023Updated 2 years ago
- Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoenco…☆182Oct 19, 2021Updated 4 years ago
- Code to generate datasets used in "How Useful is Self-Supervised Pretraining for Visual Tasks?"☆22Apr 13, 2020Updated 5 years ago
- Code for recreating the HoS benchmark of VISOR☆22Jul 2, 2023Updated 2 years ago
- Command-line tool for downloading and extending the RedCaps dataset.☆50Dec 18, 2023Updated 2 years ago
- GIFS: Neural Implicit Function for General Shape Representation, CVPR 2022☆77Jul 13, 2024Updated last year
- ☆22Mar 20, 2024Updated last year
- [FGVC9-CVPR 2022] The second place solution for 2nd eBay eProduct Visual Search Challenge.☆26Aug 9, 2022Updated 3 years ago
- [ECCV 2022] PlaneFormers: From Sparse View Planes to 3D Reconstruction☆86Oct 24, 2022Updated 3 years ago
- HInt dataset from HaMeR: Reconstructing Hands in 3D with Transformers☆52May 1, 2024Updated last year
- Self-supervised Correspondence Estimation via Multiview Registration☆58Jan 23, 2024Updated 2 years ago
- [CVPR 2023] Learning Visual Representations via Language-Guided Sampling☆149Apr 13, 2023Updated 2 years ago
- Visualizing representations with diffusion based conditional generative model.☆107May 3, 2023Updated 2 years ago
- Code for paper Background Prompting for Improved Object Depth☆29Sep 7, 2023Updated 2 years ago
- Code for the Paper: [ECCV2022] Sound Localization by Self-Supervised Time-Delay Estimation☆25Mar 15, 2023Updated 2 years ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- Pytorch implementation of "TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut"☆64Jan 20, 2023Updated 3 years ago
- Website for TextVQA dataset.☆28Apr 30, 2023Updated 2 years ago
- 🔥 [ICLR 2025] Official Benchmark Toolkits for "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆39Nov 21, 2025Updated 3 months ago
- ☆35Nov 10, 2021Updated 4 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Nov 14, 2022Updated 3 years ago
- [CVPR 2024] 3DFIRES: Few Image 3D REconstruction for Scenes with Hidden Surfaces☆26Mar 28, 2024Updated last year
- [NeurIPS 2022 Spotlight] Improving 3D-aware Image Synthesis with A Geometry-aware Discriminator☆30Oct 3, 2022Updated 3 years ago
- Teach-DETR: Better Training DETR with Teachers☆31Mar 18, 2024Updated last year
- This code provides a PyTorch implementation for OTTER (Optimal Transport distillation for Efficient zero-shot Recognition), as described …☆71Dec 20, 2021Updated 4 years ago
- Project and dataset webpage:☆286Oct 12, 2023Updated 2 years ago
- Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from image…☆285Aug 6, 2024Updated last year
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆138Aug 23, 2025Updated 6 months ago