☆77Apr 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for visual_jigsaw
Users that are interested in visual_jigsaw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026] FluxMem: Adaptive Hierarchical Memory for Streaming Video Understanding☆59Mar 16, 2026Updated last month
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆35Feb 28, 2026Updated 2 months ago
- ☆16May 30, 2025Updated 11 months ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- A local AI assistant running on your device. It turns your files into actionable memory.☆55Mar 24, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆231Nov 28, 2025Updated 5 months ago
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆83Feb 27, 2026Updated 2 months ago
- ☆12Jan 10, 2025Updated last year
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆41Nov 19, 2025Updated 5 months ago
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆74Feb 27, 2026Updated 2 months ago
- ☆16Sep 29, 2024Updated last year
- [ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆80Mar 31, 2026Updated last month
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆62Apr 3, 2026Updated last month
- ☆30Feb 27, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆22Sep 16, 2025Updated 7 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- ☆25Jun 5, 2025Updated 11 months ago
- Boosting Multi-view Stereo with Late Cost Aggregation☆13Jan 24, 2024Updated 2 years ago
- [ICLR'25] Reconstructive Visual Instruction Tuning☆134Apr 9, 2025Updated last year
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆23Feb 5, 2026Updated 3 months ago
- [CVPR 2026] An official implementation of "Think Visually, Reason Textually: Vision-Language Synergy in ARC"☆41Nov 26, 2025Updated 5 months ago
- ☆16Dec 6, 2014Updated 11 years ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆39Feb 13, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning☆29May 24, 2025Updated 11 months ago
- [CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens☆272Aug 2, 2025Updated 9 months ago
- 🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)☆241Jan 4, 2026Updated 4 months ago
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆25Dec 1, 2025Updated 5 months ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆69Jan 23, 2026Updated 3 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆45Apr 22, 2026Updated 2 weeks ago
- [ICCV 2025] ACE-G is an architecture and pre-training scheme to improve generalization for scene coordinate regression-based visual reloc…☆93Feb 20, 2026Updated 2 months ago
- Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆148Aug 21, 2025Updated 8 months ago
- Rendering SMPL using neural-mesh-render!!☆12Aug 6, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆44Mar 11, 2025Updated last year
- Long Context Transfer from Language to Vision☆403Mar 18, 2025Updated last year
- Code for "StreamingTalker: Audio-driven 3D Facial Animation with Autoregressive Diffusion Model", AAAI2026 Oral☆52Jan 16, 2026Updated 3 months ago
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆35Feb 26, 2025Updated last year
- [MMM‘24 Oral]CT-MVSNet: Efficient Multi-View Stereo with Cross-scale Transformer☆18Apr 18, 2024Updated 2 years ago
- v1: Learning to Point Visual Tokens for Multimodal Grounded Reasoning☆19Oct 6, 2025Updated 7 months ago
- Cambrian-S: Towards Spatial Supersensing in Video☆540Apr 3, 2026Updated last month