☆60Mar 5, 2026Updated 2 weeks ago
Alternatives and similar repositories for MIRBench
Users that are interested in MIRBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆57Feb 2, 2026Updated last month
- ☆59Feb 12, 2026Updated last month
- Official repository of the paper "Exploring What Why and How: A Multifaceted Benchmark for Causation Understanding of Video Anomaly"☆83Dec 25, 2024Updated last year
- AntiRec is a cross-platform app that uses advanced audio processing to subtly alter microphone input, preventing ASR recognition while ke…☆185Aug 18, 2025Updated 7 months ago
- Self-use code examples for remote management of the vsphere platform using the pyvmomi library☆66Jan 7, 2025Updated last year
- Cloud API-based English speaking practice application☆61Dec 29, 2024Updated last year
- [CVPR 2024] Official repository of the paper "Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Vid…☆88Dec 23, 2025Updated 3 months ago
- Simple and efficient -- a novel unsupervised community detection with the fusion of modularity and network structure☆104Dec 26, 2024Updated last year
- Official implement of MIA-DPO☆72Jan 23, 2025Updated last year
- Open foundation models, such LLama2, ChatGLM, etc.☆119Sep 18, 2024Updated last year
- StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding☆150May 16, 2025Updated 10 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆173Jul 4, 2024Updated last year
- Official implementation of "Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM"☆151Mar 22, 2025Updated last year
- Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"☆181Feb 25, 2025Updated last year
- PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models☆262Aug 5, 2025Updated 7 months ago
- The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025☆278May 26, 2025Updated 9 months ago
- DeepAudit:人人拥有的 AI 黑客战队,让漏洞挖掘触手可及。国内首个开源的代码漏洞挖掘多智能体系统。小 白一键部署运行,自主协作审计 + 自动化沙箱 PoC 验证。支持 Ollama 私有部署 ,一键生成报告。支持中转站。让安全不再昂贵,让审计不再复杂。☆5,368Mar 14, 2026Updated last week
- The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Mem…☆396Apr 20, 2024Updated last year
- Awesome papers & datasets specifically focused on long-term videos.☆360Oct 9, 2025Updated 5 months ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆401Aug 24, 2024Updated last year
- R1-onevision, a visual language model capable of deep CoT reasoning.☆577Apr 13, 2025Updated 11 months ago
- MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless compatibility and acceleration.☆914Mar 8, 2026Updated 2 weeks ago
- 📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).☆996Sep 27, 2025Updated 5 months ago
- This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-bas…☆1,380Feb 26, 2026Updated 3 weeks ago
- [IJCV] Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation☆1,150Sep 13, 2025Updated 6 months ago
- A fork to add multimodal model training to open-r1☆1,507Feb 8, 2025Updated last year
- [ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the cap…☆1,499Aug 5, 2025Updated 7 months ago
- [ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning☆2,132Dec 12, 2025Updated 3 months ago
- Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources☆2,131Updated this week
- A list of awesome papers and resources of recommender system on large language model (LLM).☆2,240Mar 17, 2025Updated last year
- A collection of AWESOME things about Graph-Related LLMs.☆2,414Nov 5, 2025Updated 4 months ago
- 🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.☆3,116Updated this week
- Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models☆3,152Jan 10, 2025Updated last year
- 从零开始内网渗透学习☆3,017Apr 8, 2016Updated 9 years ago
- 中文nlp解决方案(大模型、数据、模型、训练、推理)☆3,786Aug 5, 2025Updated 7 months ago
- 手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube☆3,876Jul 15, 2024Updated last year
- text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。☆4,953Feb 14, 2026Updated last month
- Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.☆5,829Aug 29, 2025Updated 6 months ago
- 中国大模型☆6,417Nov 30, 2024Updated last year