extremebird / Hydra
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Hydra
- This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆62Updated 4 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆33Updated 4 months ago
- ☆48Updated 4 months ago
- [BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition☆69Updated 2 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆26Updated 4 months ago
- More dimensions = More fun☆21Updated 3 months ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆68Updated last year
- The official implementation of "Adapter is All You Need for Tuning Visual Tasks".☆71Updated 2 months ago
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆55Updated 9 months ago
- [NeurIPS2024 Spotlight] The official implementation of GrootVL: Tree Topology is All You Need in State Space Model☆81Updated 4 months ago
- This repository contains the pytorch code for our ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training".☆52Updated 3 weeks ago
- Video Diffusion State Space Models☆19Updated 7 months ago
- ☆53Updated last year
- [CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆93Updated 7 months ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆48Updated last year
- Collect papers about Mamba (a selective state space model).☆13Updated 3 months ago
- ☆41Updated 7 months ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆76Updated 9 months ago
- Official repository of paper "Subobject-level Image Tokenization"☆62Updated 6 months ago
- [CVPR'23] Hard Patches Mining for Masked Image Modeling☆87Updated 10 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆62Updated 2 months ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆29Updated 5 months ago
- [NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions☆60Updated 6 months ago
- Official implementation of the paper "Masked Autoencoders are Efficient Class Incremental Learners"☆36Updated 5 months ago
- [CVPR 2024] Official implementation of "Adapters Strike Back"☆32Updated 3 months ago
- ☆32Updated 11 months ago
- Official PyTorch implementation for "Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels"☆78Updated 9 months ago
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆41Updated 7 months ago
- Official implementation of TagAlign☆32Updated 7 months ago