xiaogang00 / MTFormerLinks
This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"
☆200Updated 3 years ago
Alternatives and similar repositories for MTFormer
Users that are interested in MTFormer are comparing it to the libraries listed below
Sorting:
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆252Updated last week
- (IJCV 2024 & ACM MM 2021 Oral) Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation☆119Updated 3 years ago
- PySegMetrics (PSM): A Python-based Simple yet Efficient Evaluation Toolbox for Segmentation-like tasks☆123Updated last year
- ☆206Updated 6 months ago
- ☆293Updated last month
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆214Updated last month
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆574Updated 3 months ago
- The summary of code and paper for unified model towards context-dependent (CD) concept segmentation.☆119Updated 3 months ago
- [Nature Communications 2025] Towards Expert-level Autonomous Carotid Ultrasonography with Large-scale Learning-based Robotic System☆276Updated last month
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆65Updated 4 months ago
- A curated collection of AI+X papers published in Nature / Science / Cell / Lancet / Radiology and their flagship sub-journals☆136Updated 2 months ago
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆205Updated 3 months ago
- (TIP 2022) Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction☆109Updated 8 months ago
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆88Updated last month
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆341Updated last month
- PyTorch implementation for "Unlearning the Noisy Correspondence Makes CLIP More Robust (ICCV 2025)"☆68Updated 2 months ago
- a multiscale multimodal large language models for radiology report generation (RRG) tasks☆270Updated 3 months ago
- (CVPR 2024 & arXiv 2025) Power Battery Detection☆310Updated 2 months ago
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆113Updated 6 months ago
- ☆199Updated last month
- [NeurIPS 2025 (D&B)] Rethinking Evaluation of Infrared Small Target Detection☆249Updated 2 months ago
- ☆67Updated 4 months ago
- This is the pytorch implementation for AAAI2022 paper "Hierarchical Image Generation via Transformer-Based Sequential Patch Selection"☆84Updated 3 years ago
- Official repository of MMGenBench☆120Updated 9 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆60Updated 9 months ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆359Updated this week
- Test-Time Augmentation library for Pytorch☆66Updated last year
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆159Updated last week
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution☆354Updated last week
- ☆385Updated 4 months ago