[ICLR 2026] Official implementation of "Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs"
☆160Oct 31, 2025Updated 8 months ago
Alternatives and similar repositories for PaDT
Users that are interested in PaDT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IEEE TMI 2024] PASS: Prompt tuning for both styles and semantic shapes☆20Feb 12, 2025Updated last year
- [IGARSS 2024] Code for "CLIP-Guided Source-Free Object Detection in Aerial Images"☆28Dec 2, 2024Updated last year
- [S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models☆21Feb 18, 2025Updated last year
- N-EPIC-Kitchens: The event-based camera extension of the large-scale EPIC-Kitchens dataset.☆23May 10, 2022Updated 4 years ago
- 🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)☆245Jan 4, 2026Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR2026] Generalizable Knowledge Distillation from Vision Foundation Models for Semantic Segmentation☆46Mar 12, 2026Updated 3 months ago
- IROS☆17Aug 10, 2025Updated 10 months ago
- GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning☆30Mar 27, 2026Updated 3 months ago
- IEEE TMI paper: A multi-step modality fusion network for identifying the histologic subtypes of metastatic cervical lymphadenopathy☆10Nov 23, 2022Updated 3 years ago
- Distribution Aware Tuning☆16Aug 29, 2024Updated last year
- [IEEE TMI 2025] MIRROR: Multi-Modal Pathological Self-Supervised Representation Learning via Modality Alignment and Retention☆19Dec 15, 2025Updated 6 months ago
- ☆11Oct 2, 2024Updated last year
- [ICML22] Balancing Discriminability and Transferability for Source-Free Domain Adaptation☆11Oct 23, 2023Updated 2 years ago
- [NeurIPS 2022] Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering☆49Oct 6, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- RGBD2: Generative Scene Synthesis via Incremental View Inpainting using RGBD Diffusion Models☆100Mar 17, 2023Updated 3 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆40Jun 8, 2021Updated 5 years ago
- Discover the repository for "Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foun…☆21Mar 22, 2025Updated last year
- [TGRS 2025] Code for "PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images"☆70Nov 4, 2025Updated 8 months ago
- Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection☆13Updated this week
- [ICML' 24] Unsupervised Domain Adaptation for Anatomical Structure Detection in Ultrasound Images.☆11Jul 12, 2024Updated last year
- (Accepted by AAAI2025) official code of AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Se…☆17Jan 7, 2025Updated last year
- ☆10Oct 26, 2023Updated 2 years ago
- A curated list of researches in object-centric learning☆11Oct 14, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10May 10, 2024Updated 2 years ago
- 【IEEE TPAMI 2025】Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding☆34Mar 17, 2026Updated 3 months ago
- DA-AIM: Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action Detection☆12Oct 6, 2022Updated 3 years ago
- GPT Table Semantic Parsing with complex & non-intuitive structure.☆17Jul 16, 2025Updated 11 months ago
- A General-purpose Person Re-identification Task with Instructions☆201Apr 1, 2024Updated 2 years ago
- MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities☆20May 27, 2025Updated last year
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆12May 26, 2024Updated 2 years ago
- [MICCAI 2024] Implicit Representation Embraces Challenging Attributes of Pulmonary Airway Tree Structures☆14Nov 13, 2024Updated last year
- Provides current Voreen Sources (with modifications) by Uni Münster to build voreen for PC, server or lrz cluster, including workspaces a…☆15Mar 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR2024 Hightlight] No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation☆119Apr 20, 2024Updated 2 years ago
- Persistent homology calculation for 1D (scalar time series), 2D (image), and 3D, 4D (voxel) arrays☆70May 13, 2026Updated last month
- Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts (ICCV, 2025)☆25Dec 8, 2025Updated 6 months ago
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics☆21Nov 18, 2025Updated 7 months ago
- 原神锄地自动传送、快捡、qm、连续冲刺辅助脚本☆15Nov 11, 2024Updated last year
- ☆25Oct 30, 2024Updated last year
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆17May 24, 2024Updated 2 years ago