Amshaker/MAVOS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Amshaker/MAVOS)

Amshaker / MAVOS

[WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory

☆61

Alternatives and similar repositories for MAVOS

Users that are interested in MAVOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mbzuai-oryx / VideoMathQA
View on GitHub
VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videos
☆24May 7, 2026Updated 2 months ago
Amshaker / GroupMamba
View on GitHub
[CVPR -2025] GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model
☆142Mar 22, 2025Updated last year
zht8506 / QMVOS
View on GitHub
Code of ICME2024 Paper: Video Object Segmentation with Dynamic Query Modulation
☆12Mar 23, 2024Updated 2 years ago
mbzuai-oryx / ClimateGPT
View on GitHub
[EMNLP'23] ClimateGPT: a specialized LLM for conversations related to Climate Change and Sustainability topics in both English and Arabi…
☆79Sep 24, 2024Updated last year
HashmatShadab / APR
View on GitHub
(BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …
☆35Jan 8, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mbzuai-oryx / EvoLMM
View on GitHub
Self Evolving Large Multimodal Models with Continuous Rewards
☆25Jun 9, 2026Updated last month
Amshaker / SwiftFormer
View on GitHub
[ICCV - 2023] Official repository of paper SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applic…
☆317Jul 18, 2025Updated last year
uncbiag / LiVOS
View on GitHub
LiVOS: Light Video Object Segmentation with Gated Linear Matching (CVPR 2025)
☆48Sep 1, 2025Updated 10 months ago
Amshaker / Mobile-VideoGPT
View on GitHub
Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model
☆142Aug 6, 2025Updated 11 months ago
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated 2 years ago
Restricted-Memory / RMem
View on GitHub
official repository of CVPR 2024 paper, RMem: Restricted Memory Banks Improve Video Object Segmentation
☆53Jun 18, 2026Updated last month
OmkarThawakar / composed-video-retrieval
View on GitHub
Composed Video Retrieval
☆62May 2, 2024Updated 2 years ago
mbzuai-oryx / CVRR-Evaluation-Suite
View on GitHub
[CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…
☆50Aug 23, 2024Updated last year
hy0523 / MTNet
View on GitHub
Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation[TNNLS2024]
☆14May 6, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
mbzuai-oryx / PALO
View on GitHub
(WACV 2025 - Oral) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, H…
☆85Aug 5, 2025Updated 11 months ago
amandpkr / XM-GAN
View on GitHub
[MICCAI 2023][Early Accept] Official code repository of paper titled "Cross-modulated Few-shot Image Generation for Colorectal Tissue Cla…
☆47Sep 28, 2023Updated 2 years ago
HashmatShadab / HSAT
View on GitHub
[MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology
☆12Jun 17, 2025Updated last year
abdohelmy / D-3Former
View on GitHub
Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".
☆25Jul 10, 2023Updated 3 years ago
mbzuai-oryx / ARB
View on GitHub
ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark
☆17May 25, 2025Updated last year
mbzuai-oryx / XrayGPT
View on GitHub
[BIONLP@ACL 2024] XrayGPT: Chest Radiographs Summarization using Medical Vision-Language Models.
☆529Aug 8, 2024Updated last year
mmaaz60 / mdef_detr
View on GitHub
☆11May 9, 2023Updated 3 years ago
hmchuong / MaGGIe
View on GitHub
[CVPR24] MaGGIe: Mask Guided Gradual Human Instance Matting
☆79Dec 26, 2024Updated last year
mbzuai-oryx / Video-CoM
View on GitHub
Video-CoM: Interactive Video Reasoning via Chain of Manipulations
☆22Jun 17, 2026Updated last month
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
HashmatShadab / Robustness-of-Volumetric-Medical-Segmentation-Models
View on GitHub
[BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
☆15Nov 1, 2024Updated last year
Ali2500 / BURST-benchmark
View on GitHub
☆81Aug 19, 2023Updated 2 years ago
hanoonaR / object-centric-ovd
View on GitHub
[NeurIPS 2022] Official repository of paper titled "Bridging the Gap between Object and Image-level Representations for Open-Vocabulary …
☆297Oct 12, 2022Updated 3 years ago
ShahinaKK / LWI-VMS
View on GitHub
Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]
☆22Oct 27, 2024Updated last year
ziplab / MPVSS
View on GitHub
☆33Feb 29, 2024Updated 2 years ago
QianWangX / VidSeg_diffusion
View on GitHub
Implementation of Zero-Shot Video Semantic Segmentation [CVPR 2025]
☆60Feb 27, 2025Updated last year
ahmed1996said / dubai-housing-prices
View on GitHub
ML model trained on data from Bayut.com to predict housing prices in Dubai
☆17Aug 21, 2025Updated 11 months ago
amandpkr / GMNR
View on GitHub
(ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.
☆18Sep 28, 2023Updated 2 years ago
mbzuai-oryx / BiMediX
View on GitHub
Bilingual Medical Mixture of Experts LLM
☆33Nov 23, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
muzairkhattak / transformers-transforming-vision
View on GitHub
Validating image classification benchmark results on ViTs and ResNets (v2)
☆13Nov 3, 2022Updated 3 years ago
umair1221 / WorldCache
View on GitHub
WorldCache: Content-Aware Caching for Accelerated Video World Models
☆21Jun 28, 2026Updated 3 weeks ago
muzairkhattak / ViFi-CLIP
View on GitHub
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
☆309Apr 3, 2024Updated 2 years ago
YChienHung / PolypMix
View on GitHub
Official PyTorch implementation of PolypMix
☆15Nov 22, 2024Updated last year
NiFangBaAGe / DATTT
View on GitHub
[CVPR 2024] Depth-aware Test-Time Training for Zero-shot Video Object Segmentation
☆29Apr 28, 2025Updated last year
HashmatShadab / MambaRobustness
View on GitHub
[CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"
☆26Jun 8, 2025Updated last year
mbzuai-oryx / LongShOT
View on GitHub
A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos
☆21Jun 20, 2026Updated last month