☆69Nov 5, 2025Updated 4 months ago
Alternatives and similar repositories for visual_jigsaw
Users that are interested in visual_jigsaw are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated 9 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆27Feb 28, 2026Updated last week
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 3 months ago
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆34Dec 16, 2025Updated 2 months ago
- ☆22Sep 16, 2025Updated 5 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆35Feb 26, 2026Updated last week
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆25Dec 1, 2025Updated 3 months ago
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆22Feb 5, 2026Updated last month
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆34Nov 19, 2025Updated 3 months ago
- ☆12Jan 10, 2025Updated last year
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- ☆44Feb 13, 2026Updated 3 weeks ago
- [CVPR 2025] Official implementation of the paper "Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Poin…☆16Dec 24, 2025Updated 2 months ago
- Official implementation of the paper: "ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation"☆47Feb 11, 2026Updated 3 weeks ago
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆19Dec 30, 2025Updated 2 months ago
- OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning☆27May 24, 2025Updated 9 months ago
- ☆65Feb 1, 2026Updated last month
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆40Feb 27, 2026Updated last week
- A local AI assistant running on your device. It turns your files into actionable memory.☆54Feb 15, 2026Updated 2 weeks ago
- [ICLR 2026 🔥] Dr.LLM: Dynamic Layer Routing in LLMs☆41Oct 15, 2025Updated 4 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆204Nov 28, 2025Updated 3 months ago
- ☆138Feb 13, 2026Updated 3 weeks ago
- Training Transformers with knowledge localization (SGTM)☆48Jan 11, 2026Updated last month
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆46Mar 5, 2024Updated 2 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- [NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding☆79Dec 14, 2025Updated 2 months ago
- Repo for "Large Language Model Reasoning Failures"☆154Feb 17, 2026Updated 2 weeks ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆31Jun 12, 2025Updated 8 months ago
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆56Jan 23, 2026Updated last month
- Scaling Agentic Environments Automatically.☆54Jan 22, 2026Updated last month
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆59Jan 23, 2025Updated last year
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆172Sep 19, 2025Updated 5 months ago
- NeurIPS2024-Papers-about-Autonomous-Driving☆19Nov 18, 2024Updated last year
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆68Jan 13, 2026Updated last month
- [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆145Feb 23, 2026Updated last week
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆27Feb 24, 2025Updated last year
- PeRL: Parameter-Efficient Reinforcement Learning☆71Feb 23, 2026Updated last week
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆35Feb 13, 2025Updated last year
- a unified reinforcement learning toolbox for joint RL on language models and diffusion models☆75Feb 7, 2026Updated last month