From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models
☆39Updated this week
Alternatives and similar repositories for DPE
Users that are interested in DPE are comparing it to the libraries listed below
Sorting:
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆41Feb 10, 2026Updated 2 weeks ago
- A Knowledge-grounded framework for Autonomous ML/AI Program Synthesis and Optimization☆74Feb 20, 2026Updated last week
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆38Jan 23, 2026Updated last month
- On demand communication☆32Feb 12, 2026Updated 2 weeks ago
- Residual Context Diffusion (RCD): Repurposing discarded signals as structured priors for high-performance reasoning in dLLMs.☆54Feb 11, 2026Updated 2 weeks ago
- Official implementation of Log-linear Sparse Attention (LLSA).☆58Feb 2, 2026Updated 3 weeks ago
- sora2 free watermark remover☆767Feb 20, 2026Updated last week
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- ASTRA is an end-to-end system for synthesizing agentic trajectories and rule-verifiable environments for SFT and RL training, developed b…☆114Jan 30, 2026Updated last month
- Dr. MAS is an end-to-end RL training framework for multi-agent LLM systems, supporting the co-training of multiple (heterogeneous) LLMs.☆89Feb 11, 2026Updated 2 weeks ago
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Feb 19, 2026Updated last week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- ☆14Feb 13, 2026Updated 2 weeks ago
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- ☆92Dec 30, 2025Updated 2 months ago
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- MCP server for Grok AI API integration☆21Jun 2, 2025Updated 8 months ago
- ☆31Feb 3, 2026Updated 3 weeks ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- ☆24Dec 19, 2025Updated 2 months ago
- ☆13Oct 21, 2024Updated last year
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- [CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"☆32Nov 11, 2025Updated 3 months ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆26Updated this week
- Procedural terrain generation with diffusion models☆108Feb 2, 2026Updated 3 weeks ago
- Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"☆22Feb 15, 2026Updated 2 weeks ago
- A UI designer for constructing AI applications with OpenSearch☆16Updated this week
- ☆19Dec 1, 2025Updated 3 months ago
- ☆35Feb 12, 2026Updated 2 weeks ago
- OpenCode port of Flow-Next: plan-first workflows, Ralph autonomous mode (overnight coding with fresh context), multi-model review gates v…☆34Jan 23, 2026Updated last month
- Microsoft Graph CLI - Mail, Calendar, OneDrive, To-Do, Contacts☆48Jan 26, 2026Updated last month
- The stl files and code for the V2 DexHand☆47May 26, 2025Updated 9 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Jan 24, 2026Updated last month
- ☆11May 6, 2025Updated 9 months ago
- ☆25Dec 14, 2025Updated 2 months ago
- A model context protocol implementation granting LLMs access to make database queries and learn about supabase types.☆14Dec 13, 2024Updated last year
- The open-source language model computer☆10Mar 22, 2024Updated last year
- Metadata browser of TREC☆10Feb 20, 2026Updated last week
- Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control☆36Updated this week