☆31Nov 17, 2024Updated last year
Alternatives and similar repositories for MMRel
Users that are interested in MMRel are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆23Jun 13, 2025Updated 8 months ago
- [NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation☆20Jan 3, 2024Updated 2 years ago
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆66Aug 30, 2025Updated 6 months ago
- 🚀 😂 spring cloud alibaba project☆22Dec 19, 2023Updated 2 years ago
- A股历史复盘☆24Jun 29, 2023Updated 2 years ago
- ☆18Apr 20, 2025Updated 10 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆20Jan 11, 2026Updated last month
- Visualize attention maps in Diffusion Models☆22Mar 10, 2025Updated 11 months ago
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)☆42Dec 16, 2025Updated 2 months ago
- NeuSyRE: A Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment☆22Mar 10, 2024Updated last year
- 轻量级业务中台开发框架,中台设计完美实现,赋能业务。☆14Feb 25, 2023Updated 3 years ago
- 使用netty+zookeeper实现的简易版rpc框架✨☆61Jun 17, 2024Updated last year
- Unofficial implementation for SOLOv2 instance segmentation☆15Jun 13, 2020Updated 5 years ago
- Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)☆22Sep 27, 2023Updated 2 years ago
- Using different CNN models to train on GTZAN Dataset☆40Nov 14, 2023Updated 2 years ago
- Project for Polkadot Hackathon☆37Apr 2, 2022Updated 3 years ago
- [ICML2024] Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models'☆22Jan 1, 2025Updated last year
- Learning from Noisy Anchors for One-stage Object Detection☆27Apr 14, 2021Updated 4 years ago
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆49Jan 9, 2024Updated 2 years ago
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆130Aug 23, 2024Updated last year
- [CVPR2025] Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆108May 29, 2025Updated 9 months ago
- [ECCV'24 Oral] Anytime Continual Learning for Open Vocabulary Classification☆24Oct 17, 2024Updated last year
- FreeVA: Offline MLLM as Training-Free Video Assistant☆69Jun 9, 2024Updated last year
- 👾 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)☆74Jan 20, 2025Updated last year
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆33Jul 12, 2023Updated 2 years ago
- a scalable short link generation service to improve marketing efforts☆21Apr 11, 2024Updated last year
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆34Nov 13, 2024Updated last year
- Robust estimations from distribution structures: III. Non-asymptotic☆25Feb 10, 2024Updated 2 years ago
- ☆32Jul 29, 2024Updated last year
- A curated list of Egocentric Action Understanding resources☆46Nov 26, 2025Updated 3 months ago
- A zk-SNARK implementation☆50Dec 18, 2022Updated 3 years ago
- Using reference images to control style in text-to-image diffusion models. Based on CSD and IP Adapter☆54Mar 24, 2025Updated 11 months ago
- ☆32Jul 10, 2024Updated last year
- ☆18Oct 19, 2024Updated last year
- ☆70Mar 10, 2025Updated 11 months ago
- php tool functions☆49Feb 6, 2022Updated 4 years ago
- This repository contains the core methods and models described in the paper “Represent Code as Action Sequence for Predicting Next Method…☆55Sep 15, 2024Updated last year
- Official implementation of TagAlign☆35Dec 11, 2024Updated last year
- 待遇任务执行器-一个简单的任务执行器☆26Mar 26, 2025Updated 11 months ago