CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning
☆28Feb 11, 2026Updated last month
Alternatives and similar repositories for CamReasoner
Users that are interested in CamReasoner are comparing it to the libraries listed below
Sorting:
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 11 months ago
- ECCV 2024 STMA & CVPR 2024 1st MOSE & 1st VOT Challenge & 1st LSVOS v6☆12Oct 16, 2024Updated last year
- An unofficial re-implementation of Graph Structure of Neural Networks (Jiaxuan You · Kaiming He · Jure Leskovec · Saining Xie) ICML 2020☆10Jul 27, 2020Updated 5 years ago
- Accelerate convolution neural network for face recognition using GPU☆13Nov 24, 2020Updated 5 years ago
- ☆16Apr 4, 2025Updated 11 months ago
- Code repo for KDD'22 paper : 'RES: A Robust Framework for Guiding Visual Explanation'☆32Aug 21, 2022Updated 3 years ago
- ☆13May 17, 2025Updated 10 months ago
- [IEEE VL/HCC'25]Frontend Diffusion is an end-to-end LLM-powered tool that generates high-quality websites from user sketches.☆19Oct 10, 2025Updated 5 months ago
- [AAAI-2024] Official Pytorch implementation of "ColNeRF: Collaboration for Generalizable Sparse Input Neural Radiance Field"☆18Jul 8, 2025Updated 8 months ago
- [ICCV-2025] Official Pytorch implementation of "AFUNet: Cross-Iterative Alignment-Fusion Synergy for HDR Reconstruction via a Deep Unfold…☆21Jul 8, 2025Updated 8 months ago
- [NeurIPS 2022] Unsupervised Multi-View Object Segmentation Using Radiance Field Propagation☆14Nov 9, 2022Updated 3 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- 这是关于软件工程课程设计的代码仓库,我们的项目将计划针对“海外藏中国文物”进行信息采集、关于及在线服务☆16May 24, 2023Updated 2 years ago
- ☆16Oct 4, 2024Updated last year
- ☆11Oct 2, 2024Updated last year
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago
- ☆15Jul 14, 2022Updated 3 years ago
- ☆49Jun 19, 2024Updated last year
- [EMNLP 2024 Main] Official repository of paper "SLANG: New Concept Comprehension of Large Language Models"☆14Oct 27, 2024Updated last year
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Code for paper "RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text"☆18May 30, 2024Updated last year
- [NeurIPS 2025] The official code for "IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation"☆22Jun 5, 2025Updated 9 months ago
- PiLSL is a pairwise interaction learning-based graph neural network (GNN) model for prediction of synthetic lethality (SL) as anti-cancer…☆12Dec 4, 2024Updated last year
- [TIP-2025] Official Pytorch implementation of "Structural Similarity-Inspired Unfolding for Lightweight Image Super-Resolution"☆29Jul 8, 2025Updated 8 months ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆67Jul 22, 2025Updated 7 months ago
- Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471☆21Jun 19, 2024Updated last year
- ☆14Jul 1, 2023Updated 2 years ago
- ☆20Jun 30, 2025Updated 8 months ago
- [TCSVT] state-of-the-art open vocabulary detector on COCO/LVIS/V3Det☆33Jun 3, 2025Updated 9 months ago
- [NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"☆105Nov 9, 2023Updated 2 years ago
- install colmap in a docker☆12Apr 24, 2020Updated 5 years ago
- [AAAI 2026 Oral] SpatialActor: Exploring Disentangled Spatial Representations for Robust Robotic Manipulation☆61Jan 14, 2026Updated 2 months ago
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆92Sep 12, 2025Updated 6 months ago
- [CVPR 2025] LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting☆36Sep 16, 2025Updated 6 months ago
- Code for Deep Learning GPU Benchmark: A Latency-Based Approach☆14Mar 21, 2025Updated last year
- [ICML 2024] Scale-Free Image Keypoints Using Differentiable Persistent Homology☆11May 5, 2024Updated last year
- ☆44Jul 9, 2025Updated 8 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆35Sep 9, 2024Updated last year
- [ECCV 2024] VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement☆36Jul 29, 2024Updated last year