hananshafi/MTL-ViT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hananshafi/MTL-ViT)

hananshafi / MTL-ViT

A new multi-task learning framework using Vision Transformers

☆11

Alternatives and similar repositories for MTL-ViT

Users that are interested in MTL-ViT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HashmatShadab / Robustness-of-Volumetric-Medical-Segmentation-Models
View on GitHub
[BMVC 2024] On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models
☆15Nov 1, 2024Updated last year
hananshafi / MedContext
View on GitHub
[MICCAI 2024] Official code for the paper "MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation"
☆14Nov 1, 2024Updated last year
akhtarvision / weather-regional
View on GitHub
☆11Oct 29, 2024Updated last year
HashmatShadab / MambaRobustness
View on GitHub
[CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"
☆26Jun 8, 2025Updated last year
rohit901 / VANE-Bench
View on GitHub
[NAACL'25] Contains code and documentation for our VANE-Bench paper.
☆24Aug 19, 2025Updated 11 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
HashmatShadab / HSAT
View on GitHub
[MICCAI 2025] Hierarchical Self-Supervised Adversarial Training for Robust Vision Models in Histopathology
☆12Jun 17, 2025Updated last year
fahadshamshad / deep-facial-privacy-prior
View on GitHub
[ECCVW 2024 -- ORAL] Official repository of paper titled "Makeup-Guided Facial Privacy Protection via Untrained Neural Network Priors".
☆12Oct 11, 2024Updated last year
techmn / cosnet
View on GitHub
A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes (WACV 2025)
☆12Aug 11, 2025Updated 11 months ago
ShahinaKK / LWI-VMS
View on GitHub
Learnable Weight Initialization for Volumetric Medical Image Segmentation [Elsevier AIM2024]
☆22Oct 27, 2024Updated last year
umair1221 / AgriCLIP
View on GitHub
A code
☆29Jan 23, 2025Updated last year
Hasindri / HLSS
View on GitHub
[MICCAI 2024 🔥] HLSS, the first study to explore hierarchical information inherent in histopathology images and their language descripti…
☆27Aug 5, 2024Updated last year
Muhammad-Huzaifaa / ObjectCompose
View on GitHub
[ACCV 2024] ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes 🚀🚀🚀
☆37Jan 21, 2025Updated last year
akhtarvision / cal-detr
View on GitHub
☆42Nov 9, 2023Updated 2 years ago
mbzuai-oryx / VideoMathQA
View on GitHub
VideoMathQA is a benchmark designed to evaluate mathematical reasoning in real-world educational videos
☆24May 7, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Muzammal-Naseer / DCViT-AT
View on GitHub
Official repository for "Boosting Adversarial Transferability using Dynamic Cues " (ICLR 2023)
☆20Aug 24, 2023Updated 2 years ago
mzeeshankaramat / SafeAgents
View on GitHub
☆20Jun 4, 2026Updated last month
mbzuai-oryx / CVRR-Evaluation-Suite
View on GitHub
[CVPRW-25 MMFM] Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite fo…
☆50Aug 23, 2024Updated last year
mbzuai-oryx / VideoMolmo
View on GitHub
Official code of the paper "VideoMolmo: Spatio-Temporal Grounding meets Pointing"
☆56Jul 5, 2025Updated last year
aminebdj / 3D-OWIS
View on GitHub
[NeurIPS2023] 3D-OWIS is capable of detecting unknown instances in inference, and progressively learning novel classes in the process of …
☆68Dec 3, 2023Updated 2 years ago
renytek13 / Soft-Prompt-Generation
View on GitHub
[ECCV 2024] Soft Prompt Generation for Domain Generalization
☆33Oct 1, 2024Updated last year
Razaimam45 / TTL-Test-Time-Low-Rank-Adaptation
View on GitHub
Official code repository of paper titled "Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Visio…
☆34May 11, 2025Updated last year
HashmatShadab / APR
View on GitHub
(BMVC 2022--Oral) Official repository for "Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations" …
☆35Jan 8, 2023Updated 3 years ago
HashmatShadab / Robust-LLaVA
View on GitHub
[ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models
☆29Oct 20, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mbzuai-oryx / DriveLMM-o1
View on GitHub
Reasoning DriveLMM
☆15Mar 15, 2025Updated last year
mbzuai-oryx / AIN
View on GitHub
AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding…
☆55Mar 13, 2025Updated last year
abdohelmy / D-3Former
View on GitHub
Official repository of paper titled "D3Former: Debiased Dual Distilled Transformer for Incremental Learning".
☆25Jul 10, 2023Updated 3 years ago
akhtarvision / bpc_calibration
View on GitHub
[CVPR 2023] Bridging Precision and Confidence: A Train-Time Loss for Calibrating Object Detection
☆31Jun 21, 2023Updated 3 years ago
zer0int / CLIP-ViT-visualization
View on GitHub
What do CLIP Vision Transformers learn? Feature Visualization can show you!
☆15Aug 29, 2024Updated last year
iabh1shekbasu / CalibPrompt
View on GitHub
[BMVC 2025 🔥] CalibPrompt is the first framework that enhances Med-VLM calibration during prompt tuning.
☆16Jul 13, 2026Updated 2 weeks ago
zer0int / CLIP-XAI-GUI
View on GitHub
CLIP GUI - XAI app ~ explainable (and guessable) AI with ViT & ResNet models
☆22Sep 13, 2024Updated last year
MadryLab / bias-transfer
View on GitHub
☆15Jul 24, 2022Updated 4 years ago
ChengHan111 / VPT-or-FT
View on GitHub
Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)
☆13Mar 8, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
amandpkr / GMNR
View on GitHub
(ICCV 2023) Generative Multiplane Neural Radiance for 3D Aware Image Generation.
☆18Sep 28, 2023Updated 2 years ago
mbzuai-oryx / ARB
View on GitHub
ARB: A Comprehensive Arabic Multimodal Reasoning Benchmark
☆17May 25, 2025Updated last year
jiaangli / VILA
View on GitHub
[TACL/EMNLP'24] Do Vision and Language Models Share Concepts? A Vector Space Alignment Study
☆16Nov 22, 2024Updated last year
techmn / cdchat
View on GitHub
A Large Multimodal Model for Remote Sensing Change Description (IGARSS 2025)
☆22Dec 17, 2025Updated 7 months ago
gefend / LIMITR
View on GitHub
Implementation of the paper LIMITR: Leveraging Local Information for Medical Image-Text Representation
☆17Jul 21, 2026Updated last week
mbzuai-oryx / ALM-Bench
View on GitHub
[CVPR 2025 🔥] ALM-Bench is a multilingual multi-modal diverse cultural benchmark for 100 languages across 19 categories. It assesses the…
☆47May 26, 2025Updated last year
umair1221 / WorldCache
View on GitHub
WorldCache: Content-Aware Caching for Accelerated Video World Models
☆23Jun 28, 2026Updated last month