[ICLR 2026] Official implementation of "Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs"
☆158Oct 31, 2025Updated 6 months ago
Alternatives and similar repositories for PaDT
Users that are interested in PaDT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IGARSS 2024] Code for "CLIP-Guided Source-Free Object Detection in Aerial Images"☆27Dec 2, 2024Updated last year
- [AAAI 2024] Towards Real-World Test-Time Adaptation: Tri-Net Self-Training with Balanced Normalization☆29Apr 8, 2025Updated last year
- N-EPIC-Kitchens: The event-based camera extension of the large-scale EPIC-Kitchens dataset.☆23May 10, 2022Updated 4 years ago
- 🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)☆241Jan 4, 2026Updated 4 months ago
- Official implementation for the paper "Unpaired Multi-domain Attribute Translation of 3D Facial Shapes with a Square and Symmetric Geomet…☆15Jan 3, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The official code repository for the ECCV 2024 accepted paper "Representing Topological Self-Similarity Using Fractal Feature Maps for Ac…☆29Jul 9, 2024Updated last year
- A curated list of resources on Document Layout Analysis☆12Aug 7, 2025Updated 9 months ago
- Official Pytorch implementation for "AttentionHand: Text-driven Controllable Hand Image Generation for 3D Hand Reconstruction in the Wild…☆11May 11, 2026Updated 2 weeks ago
- FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models. FS-DFM accepted for ICLR 2026☆42Jan 6, 2026Updated 4 months ago
- VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs☆308Mar 12, 2026Updated 2 months ago
- ☆14Jun 6, 2023Updated 2 years ago
- GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning☆28Mar 27, 2026Updated last month
- IEEE TMI paper: A multi-step modality fusion network for identifying the histologic subtypes of metastatic cervical lymphadenopathy☆10Nov 23, 2022Updated 3 years ago
- [ICML22] Balancing Discriminability and Transferability for Source-Free Domain Adaptation☆11Oct 23, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2022] Revisiting Realistic Test-Time Training: Sequential Inference and Adaptation by Anchored Clustering☆49Oct 6, 2023Updated 2 years ago
- ☆10May 16, 2023Updated 3 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆39Jun 8, 2021Updated 4 years ago
- Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection☆13Jun 17, 2025Updated 11 months ago
- [AAAI 2025] Official PyTorch implementation of "ZoRI: Towards Discriminative Zero-Shot Remote Sensing Instance Segmentation"☆41Aug 26, 2025Updated 8 months ago
- (Accepted by AAAI2025) official code of AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Se…☆16Jan 7, 2025Updated last year
- TPAMI:Frequency-aware Feature Fusion for Dense Image Prediction☆21Dec 25, 2025Updated 5 months ago
- ☆10Oct 26, 2023Updated 2 years ago
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆12Oct 15, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A curated list of researches in object-centric learning☆11Oct 14, 2024Updated last year
- ☆10Nov 4, 2024Updated last year
- Offical Code of MICCAI'25 Best-Paper-Shortlist paper "MedGround-R1: Advancing Medical Image Grounding via Spatial-Semantic Rewarded Group…☆40Sep 28, 2025Updated 7 months ago
- DA-AIM: Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action Detection☆12Oct 6, 2022Updated 3 years ago
- A General-purpose Person Re-identification Task with Instructions☆202Apr 1, 2024Updated 2 years ago
- MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities☆20May 27, 2025Updated 11 months ago
- Provides current Voreen Sources (with modifications) by Uni Münster to build voreen for PC, server or lrz cluster, including workspaces a…☆15Mar 2, 2024Updated 2 years ago
- The official implement of CTRNet++.☆15Dec 30, 2024Updated last year
- Controllable-LPMoE: Adapting to Challenging Object Segmentation via Dynamic Local Priors from Mixture-of-Experts (ICCV, 2025)☆24Dec 8, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICME 2025] Official implementation for "VectorPainter: Advanced Stylized Vector Graphics Synthesis Using Stroke-Style Priors" https://ar…☆21Jan 29, 2026Updated 3 months ago
- The official repository of C-CoTTA: Controllable Continual Test-Time Adaptation☆11Jun 17, 2024Updated last year
- 原神锄地自动传送、快捡、qm、连续冲刺辅助脚本☆15Nov 11, 2024Updated last year
- ☆26Oct 30, 2024Updated last year
- ☆16May 8, 2025Updated last year
- ☆24Nov 29, 2024Updated last year
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆17May 24, 2024Updated 2 years ago