☆136May 26, 2025Updated 9 months ago
Alternatives and similar repositories for D3M
Users that are interested in D3M are comparing it to the libraries listed below
Sorting:
- Breaking Boundary Between Pre-training and Fine-tuning with Hybrid Prompting for Knowledge-Based VQA☆140Mar 10, 2024Updated 2 years ago
- ☆138Sep 24, 2024Updated last year
- ☆139May 31, 2023Updated 2 years ago
- ☆135Apr 9, 2025Updated 11 months ago
- This repository is the official implementation of the paper "Understanding Few-Shot Learning: Measuring Task Relatedness and Adaptation D…☆143Oct 17, 2023Updated 2 years ago
- PreLAR: World Model Pre-training with Learnable Action Representation, ECCV 2024☆169Apr 15, 2025Updated 11 months ago
- This repository contains the reference source code for the paper ["Scalable Modular Network: A Framework for Adaptive Learning via Agreem…☆135Mar 6, 2024Updated 2 years ago
- ☆134Feb 28, 2024Updated 2 years ago
- official codes for our WACV 2024 paper (Interpretable Object Recognition by Semantic Prototype Analysis)☆141Oct 29, 2025Updated 4 months ago
- Codes for the WACV 2023 paper: "Semantic Guided Latent Parts Embedding for Few-Shot Learning"☆143Jan 28, 2023Updated 3 years ago
- JoPano: Unified Panorama Generation via Joint Modeling☆24Mar 6, 2026Updated 2 weeks ago
- Collection of works from VIPL-AVSU☆50Mar 13, 2026Updated last week
- A better shell☆15Feb 28, 2026Updated 2 weeks ago
- Jodi: Unification of Visual Generation and Understanding via Joint Modeling☆90Mar 6, 2026Updated 2 weeks ago
- ☆51Aug 22, 2025Updated 6 months ago
- Instance-level Facial Attributes Editing (CVIU 2021)☆15Jul 19, 2022Updated 3 years ago
- The official implementation of ChatTraffic.☆51Jan 14, 2025Updated last year
- Visual speech recognition with face inputs: code and models for F&G 2020 paper "Can We Read Speech Beyond the Lips? Rethinking RoI Select…☆18Apr 12, 2021Updated 4 years ago
- Gene Expression Prediction from Histology Images via Hypergraph Neural Networks☆15May 19, 2025Updated 10 months ago
- [ECCV'24] T2IShield: Defending Against Backdoors on Text-to-Image Diffusion Models☆17Dec 21, 2025Updated 2 months ago
- ☆12Feb 14, 2019Updated 7 years ago
- [ICML'24] Open-Vocabulary Calibration for Fine-tuned CLIP☆18Jun 14, 2024Updated last year
- ☆13Sep 14, 2022Updated 3 years ago
- [ICLR'23] Effective Self-supervised Pre-training on Low-compute networks without Distillation☆18Oct 9, 2024Updated last year
- ☆14Jul 6, 2025Updated 8 months ago
- ICCV 2023 (Oral) Open-domain Visual Entity Recognition Towards Recognizing Millions of Wikipedia Entities☆43Jun 7, 2025Updated 9 months ago
- [CoRL 2025] CogniPlan: Uncertainty-Guided Path Planning with Conditional Generative Layout Prediction - Public code and model☆45Jan 30, 2026Updated last month
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 7 months ago
- The codebase for ABAW4 challenge of ECCV2022 workshop.☆21Jun 18, 2023Updated 2 years ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆32Mar 10, 2026Updated last week
- The PyTorch Code and Model In "Learn an Effective Lip Reading Model without Pains", (https://arxiv.org/abs/2011.07557), which reaches the…☆165Sep 12, 2025Updated 6 months ago
- Cell localization and counting: 1) Exponential Distance Transform Maps for Cell Localization; 2) Multi-scale Hypergraph-based Feature Ali…☆23Apr 10, 2024Updated last year
- Context-Guided Prompt Learning and Attention Refinement for Zero-Shot Anomaly Detection☆33Aug 9, 2025Updated 7 months ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆28Nov 8, 2023Updated 2 years ago
- ☆143Jun 28, 2024Updated last year
- Audio-Visual Speech Recognition☆21Jul 7, 2025Updated 8 months ago
- Offical implementation of "Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation" (AAAI2025 Oral)☆36Jan 14, 2026Updated 2 months ago
- [World-Model-Survey-2024] Paper list and projects for World Model☆15Oct 31, 2024Updated last year
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago