penghao-wu / visual_jigsawView external linksLinks
☆68Nov 5, 2025Updated 3 months ago
Alternatives and similar repositories for visual_jigsaw
Users that are interested in visual_jigsaw are comparing it to the libraries listed below
Sorting:
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated 8 months ago
- Repo for "Large Language Model Reasoning Failures"☆77Updated this week
- Empowering Small VLMs to Think with Dynamic Memorization and Exploration☆15Nov 18, 2025Updated 2 months ago
- ☆21Sep 16, 2025Updated 4 months ago
- The code implementation for TTCS: Test-Time Curriculum Synthesis for Self-Evolving.☆31Feb 6, 2026Updated last week
- LiteGPT: A 124M Small Language Model (SLM) pre-trained on FineWeb and fine-tuned on Alpaca.☆34Dec 16, 2025Updated last month
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆34Nov 19, 2025Updated 2 months ago
- ☆12Jan 10, 2025Updated last year
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆24Dec 1, 2025Updated 2 months ago
- (NeurIPS 2023) Open-set visual object query search & localization in long-form videos☆26Feb 1, 2024Updated 2 years ago
- [CVPR 2025] Official implementation of the paper "Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Poin…☆16Dec 24, 2025Updated last month
- Official implementation of the paper: "ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation"☆44Jan 19, 2026Updated 3 weeks ago
- ☆87Feb 3, 2026Updated last week
- OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning☆27May 24, 2025Updated 8 months ago
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆19Dec 30, 2025Updated last month
- ☆64Feb 1, 2026Updated 2 weeks ago
- ☆15Sep 29, 2024Updated last year
- A local AI assistant running on your device. It turns your files into actionable memory.☆54Updated this week
- [ICLR 2026 🔥] Dr.LLM: Dynamic Layer Routing in LLMs☆41Oct 15, 2025Updated 4 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆197Nov 28, 2025Updated 2 months ago
- Training Transformers with knowledge localization (SGTM)☆48Jan 11, 2026Updated last month
- [NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding☆73Dec 14, 2025Updated 2 months ago
- [CVPR 2024 Highlight] PLACE: Adaptive Layout-Semantic Fusion for Semantic Image Synthesis☆46Mar 5, 2024Updated last year
- Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆54Jan 23, 2026Updated 3 weeks ago
- Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.☆60Updated this week
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- Scaling Agentic Environments Automatically.☆49Jan 22, 2026Updated 3 weeks ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆30Jun 12, 2025Updated 8 months ago
- [NeurIPS'24] SpatialEval: a benchmark to evaluate spatial reasoning abilities of MLLMs and LLMs☆59Jan 23, 2025Updated last year
- NeurIPS2024-Papers-about-Autonomous-Driving☆20Nov 18, 2024Updated last year
- [NeurIPS '25 Spotlight] Official Pytorch implementation of "Vision Transformers Don't Need Trained Registers"☆172Sep 19, 2025Updated 4 months ago
- [ICLR26] Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling☆144Jan 26, 2026Updated 2 weeks ago
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆49Dec 18, 2025Updated last month
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆65Jan 13, 2026Updated last month
- VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice☆61Jan 9, 2026Updated last month
- Awesome Vision-Language Compositionality, a comprehensive curation of research papers in literature.☆34Feb 13, 2025Updated last year
- OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆141Dec 17, 2025Updated last month
- [KernelGYM & Dr. Kernel] A distributed GPU environment and a collection of RL training methods to support RL for Kernel Generations☆83Feb 6, 2026Updated last week
- [NeurIPS'24] Official PyTorch implementation for paper "Knowledge Composition using Task Vectors with Learned Anisotropic Scaling"☆27Feb 24, 2025Updated 11 months ago