☆77Apr 9, 2026Updated 2 months ago
Alternatives and similar repositories for visual_jigsaw
Users that are interested in visual_jigsaw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated last year
- ☆16May 30, 2025Updated last year
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- Official Repo of "Flow-OPD: On-Policy Distillation for Flow Matching Models"☆226Jun 7, 2026Updated last week
- A local AI assistant running on your device. It turns your files into actionable memory.☆55Mar 24, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆18Mar 18, 2026Updated 2 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆45Mar 5, 2024Updated 2 years ago
- [ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"☆48Oct 9, 2025Updated 8 months ago
- ☆12Updated this week
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆42Nov 19, 2025Updated 6 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆239Nov 28, 2025Updated 6 months ago
- Security-native LLM system for AI-generated application security.☆263Jun 4, 2026Updated last week
- [CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆86Feb 27, 2026Updated 3 months ago
- [CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens☆284Aug 2, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆81Feb 27, 2026Updated 3 months ago
- ☆16Sep 29, 2024Updated last year
- ☆31Feb 27, 2025Updated last year
- ☆144May 21, 2026Updated 3 weeks ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆68Apr 3, 2026Updated 2 months ago
- ☆22Sep 16, 2025Updated 9 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- ☆26Jun 5, 2025Updated last year
- [AAAI 2025] RRT-MVS: Recurrent Regularization Transformer for Multi-View Stereo☆18Nov 4, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Boosting Multi-view Stereo with Late Cost Aggregation☆13Jan 24, 2024Updated 2 years ago
- [ICLR'25] Reconstructive Visual Instruction Tuning☆134Apr 9, 2025Updated last year
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆24Feb 5, 2026Updated 4 months ago
- [CVPR 2025] Official implementation of the paper "Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Poin…☆19Mar 13, 2026Updated 3 months ago
- [ICLR26] Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling☆195Jan 26, 2026Updated 4 months ago
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆24Nov 23, 2025Updated 6 months ago
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆39Feb 13, 2025Updated last year
- OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning☆29May 24, 2025Updated last year
- 🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)☆242Jan 4, 2026Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Data preprocessing for IUPUI-CSRC Pedestrian Situated Intent (PSI) benchmark dataset.☆11Oct 5, 2023Updated 2 years ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆50Apr 22, 2026Updated last month
- [TPAMI 2026] Ego-R1: Agentic Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning☆157Updated this week
- Rendering SMPL using neural-mesh-render!!☆12Aug 6, 2020Updated 5 years ago
- Long Context Transfer from Language to Vision☆407Mar 18, 2025Updated last year
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆44Mar 11, 2025Updated last year
- Official Pytorch Implementation of 'BAM-DETR: Boundary-Aligned Moment Detection Transformer for Temporal Sentence Grounding in Videos'☆36Feb 26, 2025Updated last year