☆31Nov 17, 2024Updated last year
Alternatives and similar repositories for MMRel
Users that are interested in MMRel are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation☆20Jan 3, 2024Updated 2 years ago
- Official PyTorch implementation Source code for Weakly Supervised Video Scene Graph Generation via Natural Language Supervision, accepted…☆24Jun 13, 2025Updated 10 months ago
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attention☆66Aug 30, 2025Updated 8 months ago
- ☆12Apr 18, 2025Updated last year
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…☆16May 21, 2024Updated last year
- 👾 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)☆74Jan 20, 2025Updated last year
- ☆33Jul 10, 2024Updated last year
- A股历史复盘☆24Jun 29, 2023Updated 2 years ago
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆34Jul 12, 2023Updated 2 years ago
- 使用netty+zookeeper实现的简易版rpc框架✨☆61Jun 17, 2024Updated last year
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)☆43Dec 16, 2025Updated 4 months ago
- ☆14Mar 3, 2025Updated last year
- ☆10Sep 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Robust estimations from distribution structures: III. Non-asymptotic☆25Feb 10, 2024Updated 2 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Jun 20, 2023Updated 2 years ago
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆130Aug 23, 2024Updated last year
- a scalable short link generation service to improve marketing efforts☆21Apr 11, 2024Updated 2 years ago
- Awesome paper for multi-modal llm with grounding ability☆20Oct 11, 2025Updated 6 months ago
- ☆18Oct 19, 2024Updated last year
- ☆12Jan 10, 2025Updated last year
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…☆17Nov 24, 2024Updated last year
- An official repo for WACV 2025 paper "LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spa…☆29Jan 27, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official Implementation (Pytorch) of the "Representation Shift: Unifying Token Compression with FlashAttention", ICCV 2025☆34Feb 22, 2026Updated 2 months ago
- [NeurIPS 2023] Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model☆20Dec 9, 2023Updated 2 years ago
- [AAAI 2026] Relation-R1: Progressively Cognitive Chain-of-Thought Guided Reinforcement Learning for Unified Relation Comprehension☆18Mar 6, 2026Updated 2 months ago
- Using reference images to control style in text-to-image diffusion models. Based on CSD and IP Adapter☆54Mar 24, 2025Updated last year
- Project for Polkadot Hackathon☆36Apr 2, 2022Updated 4 years ago
- A curated list of Egocentric Action Understanding resources☆50Nov 26, 2025Updated 5 months ago
- Using different CNN models to train on GTZAN Dataset☆40Nov 14, 2023Updated 2 years ago
- 基于Algowin-api升级,对接了多家交易所的websocket api☆20Jan 11, 2023Updated 3 years ago
- 2019 Tianchi Zhuifeng Juvenile Typhoon Satellite Image Prediction—Optical Flow Method Solution☆11Sep 20, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)☆22Mar 23, 2026Updated last month
- [ICML2024] Repo for the paper `Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models'☆24Jan 1, 2025Updated last year
- A curated lists of self-taught materials including research blogs☆16Dec 12, 2016Updated 9 years ago
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆110Nov 23, 2024Updated last year
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆39Mar 4, 2024Updated 2 years ago
- ☆13Oct 30, 2023Updated 2 years ago
- [EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding☆49Jan 9, 2024Updated 2 years ago