Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"
☆74Mar 20, 2026Updated 3 weeks ago
Alternatives and similar repositories for DMLR
Users that are interested in DMLR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 2 months ago
- Official Repository of LatentSeek☆80Jun 6, 2025Updated 10 months ago
- [ICLR 2026] Thinking on the Fly: Test-Time Reasoning Enhancement via Latent Thought Policy Optimization☆24Mar 6, 2026Updated last month
- ViLoMem: Agentic Learner with Grow-and-Refine Multimodal Semantic Memory☆59Nov 27, 2025Updated 4 months ago
- [ICLR 2026] SwiReasoning: Switch-Thinking in Latent and Explicit for Pareto-Superior Reasoning LLMs☆56Oct 14, 2025Updated 5 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens☆263Aug 2, 2025Updated 8 months ago
- Chain-of-Frames [CVPR 2026]☆38Jul 2, 2025Updated 9 months ago
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆60Feb 6, 2026Updated 2 months ago
- PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.☆20Jun 28, 2024Updated last year
- Official Implementation of "Geometrically-Constrained Agent for Spatial Reasoning"☆74Updated this week
- This is the official repository for the code and datasets in the paper "Progressive Open Space Expansion for Open-Set Model Attribution",…☆25Oct 22, 2023Updated 2 years ago
- 学术主页 | Academic Page☆14Apr 6, 2026Updated last week
- Unifying Specialized Visual Encoders for Video Language Models☆25Nov 22, 2025Updated 4 months ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆89Mar 27, 2026Updated 2 weeks ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- [ECCV 2022] A pytorch implementation for TS2-Net: Token Shift and Selection Transformer for Text-Video Retrieval☆79Nov 29, 2022Updated 3 years ago
- ☆24Jul 8, 2023Updated 2 years ago
- A Claude Code hook plugin for IP-based access control · 防 Claude 封号 · Claude IP 检测 · IP 地理位置拦截 · Claude 账号保护☆64Apr 1, 2026Updated last week
- VAE+GAN☆10Apr 18, 2018Updated 7 years ago
- Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning☆28Oct 30, 2024Updated last year
- Source code for SWIFT, an efficient reward model.☆20Jan 13, 2026Updated 3 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆76May 31, 2025Updated 10 months ago
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆16May 14, 2025Updated 10 months ago
- An exploration of LLM steering☆26Jun 15, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Data pre-processing and training code on Open-X-Embodiment with pytorch☆11Jan 20, 2025Updated last year
- This is the official repository for paper: cross-modal information flow in multimodal large language models☆43May 21, 2025Updated 10 months ago
- ☆23Feb 3, 2026Updated 2 months ago
- Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"☆341Jan 6, 2026Updated 3 months ago
- Official PyTorch implementation of our CVPR 2025 paper, "LoRA Subtraction for Drift-Resistant Space in Exemplar-Free Continual Learning."☆17Mar 28, 2025Updated last year
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆14Mar 11, 2025Updated last year
- Image captioning with Transformer☆14Oct 11, 2021Updated 4 years ago
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated 10 months ago
- 哈尔滨工业大学(深圳)2021年球季学期深度学习体系结构实验☆17Oct 1, 2022Updated 3 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆43Dec 22, 2025Updated 3 months ago
- The official implement of "Accelerating Multimodal Large Language Models via Dynamic Visual-Token Exit and the Empirical Findings"☆18Dec 5, 2024Updated last year
- Code for Language-Inspired Relation Transfer for Few-Shot Class-Incremental Learning in IEEE TPAMI☆15Apr 18, 2025Updated 11 months ago
- The source code for "LaSe-E2V: Towards Language-guided Semantic-Aware Event-to-Video Reconstruction"☆10Jul 5, 2024Updated last year
- Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."☆89Feb 25, 2026Updated last month
- Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)☆66Jun 7, 2024Updated last year
- ☆76Jul 28, 2025Updated 8 months ago