☆64Jan 4, 2026Updated last month
Alternatives and similar repositories for MM-BrowseComp
Users that are interested in MM-BrowseComp are comparing it to the libraries listed below
Sorting:
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆20May 27, 2025Updated 9 months ago
- This repository has moved to: https://github.com/tkipf/c-swm☆27Jan 5, 2020Updated 6 years ago
- Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"☆26Oct 20, 2022Updated 3 years ago
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆43Mar 11, 2025Updated 11 months ago
- ☆213Dec 19, 2025Updated 2 months ago
- ☆24Dec 19, 2025Updated 2 months ago
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- Group-Group Loss Based Global-Regional Feature Learning for Vehicle Re-Identification☆12May 10, 2022Updated 3 years ago
- Benchmark evaluating ocean forecasting systems against reference datasets and observations.☆26Updated this week
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆402Aug 26, 2025Updated 6 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- ☆13Oct 21, 2024Updated last year
- This is the implementation of self-CIDEr and LSA-based diversity metrics (only for python 2.7).☆36Feb 26, 2022Updated 4 years ago
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 9 months ago
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆12Oct 15, 2024Updated last year
- ☆11May 6, 2025Updated 9 months ago
- Run GEPA on your favorite non-python libraries.☆33Jan 22, 2026Updated last month
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆20Jul 3, 2025Updated 7 months ago
- It shows an intelligent agent based on LangGraph for long form writing.☆12Mar 1, 2025Updated last year
- ☆39Oct 29, 2025Updated 4 months ago
- A UI designer for constructing AI applications with OpenSearch☆16Updated this week
- Struct-aware fuzzing framework + some fuzzers☆30Jan 28, 2026Updated last month
- The open-source language model computer☆10Mar 22, 2024Updated last year
- A holistic framework for advancing LLMs as data science agents☆33Feb 3, 2026Updated 3 weeks ago
- Microsoft Graph CLI - Mail, Calendar, OneDrive, To-Do, Contacts☆48Jan 26, 2026Updated last month
- ☆24Oct 3, 2025Updated 4 months ago
- UnicEdit-10M and UnicBench project☆23Feb 8, 2026Updated 3 weeks ago
- Open-source strong baseline for domain generlization re-ID. We will udpate the strong baseline and CFD method~☆10Nov 30, 2021Updated 4 years ago
- Domain-Adaptive Multibranch Networks☆14Nov 7, 2020Updated 5 years ago
- ☆12Nov 5, 2024Updated last year
- Demo app that shows how you can use D3.js with iOS in a UIWebView.☆10May 24, 2013Updated 12 years ago
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆14Nov 25, 2024Updated last year
- ☆19Dec 1, 2025Updated 3 months ago
- A model context protocol implementation granting LLMs access to make database queries and learn about supabase types.☆14Dec 13, 2024Updated last year
- A sample project for visionOS that showcases FindSurface's functionalities.☆13Dec 18, 2025Updated 2 months ago
- SEU Summer School project, based on Kotlin and Java.☆13Sep 15, 2023Updated 2 years ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 3 months ago