YueYin27 / refrefLinks
The official PyTorch implementation of RefRef: A Synthetic Dataset and Benchmark for Reconstructing Refractive and Reflective Objects
☆15Updated 3 months ago
Alternatives and similar repositories for refref
Users that are interested in refref are comparing it to the libraries listed below
Sorting:
- Official Implementation of Diffusion Step Annealing (DiSA) in Autoregressive Image Generation☆144Updated 7 months ago
- Offical repo for ICCV25 Highlight Paper: "ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric…☆53Updated 3 months ago
- Code for our paper: Learning Camera Movement Control from Real-World Drone Videos☆34Updated 8 months ago
- A Chrome/Edge extension to help you quickly scan through the flood of daily ArXiv papers.☆15Updated 9 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆58Updated last week
- ☆54Updated 8 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆24Updated 8 months ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆111Updated 3 months ago
- Program synthesis for 3D spatial reasoning☆54Updated 6 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆285Updated 3 months ago
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20Updated 7 months ago
- [CVPR 2025] GPS as a Control Signal for Image Generation☆24Updated 9 months ago
- [CVPR'24] GraphDreamer: a novel framework of generating compositional 3D scenes from scene graphs.☆195Updated last year
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models☆110Updated last year
- ☆22Updated last year
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆86Updated 10 months ago
- ☆82Updated 7 months ago
- For Ego4D VQ3D Task☆22Updated 2 years ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆63Updated 5 months ago
- ☆53Updated 9 months ago
- [NeurIPS 2025] VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models☆151Updated this week
- Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]☆93Updated last year
- [NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding☆137Updated last month
- ☆87Updated 7 months ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆134Updated 4 months ago
- A curated list of Egocentric Action Understanding resources☆37Updated last month
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆79Updated last year
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆232Updated 3 weeks ago
- ImageNet3D: Towards General-Purpose Object-Level 3D Understanding☆19Updated last year
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆97Updated last week