extremebird / HydraLinks
☆29Updated 2 years ago
Alternatives and similar repositories for Hydra
Users that are interested in Hydra are comparing it to the libraries listed below
Sorting:
- [CVPR2025] Breaking the Low-Rank Dilemma of Linear Attention☆39Updated 11 months ago
- [ECCV 2024] Official implementation of the paper "Towards Latent Masked Image Modeling for Self-Supervised Visual Representation Learning…☆29Updated 11 months ago
- FFNet: MetaMixer-based Efficient Convolutional Mixer Design☆31Updated 11 months ago
- [ICLR2025] This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision☆90Updated 8 months ago
- ☆58Updated 2 years ago
- [AAAI 2025] Official pytorch implementation of "Diffusion Model Patching via Mixture-of-Prompts"☆13Updated last year
- Collect papers about Mamba (a selective state space model).☆14Updated last year
- List of papers related to State Space Models (Mamba) in Vision.☆37Updated last year
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Updated last year
- [NeurIPS2024 Spotlight] The official implementation of MambaTree: Tree Topology is All You Need in State Space Model☆105Updated last year
- Official Implementation (Pytorch) of "DDMI: Domain-Agnostic Latent Diffusion Models for Synthesizing High-Quality Implicit Neural Represe…☆27Updated last year
- Official repository of InLine attention (NeurIPS 2024)☆58Updated last year
- [CVPR 2024] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆101Updated last year
- Code of our CVPR2024 paper - DiffusionMTL: Learning Multi-Task Denoising Diffusion Model from Partially Annotated Data☆59Updated last year
- WeGeFT: Weight‑Generative Fine‑Tuning for Multi‑Faceted Efficient Adaptation of Large Models☆22Updated 7 months ago
- [CVPR 2024 Highlight] ImageNet-D☆46Updated last year
- (NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation☆64Updated 3 months ago
- ☆48Updated last year
- ☆33Updated last year
- More dimensions = More fun☆26Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Updated last year
- Official pytorch implementation of "Towards Practical Plug-and-Play Diffusion Models" in CVPR2023☆22Updated 2 years ago
- Adapters Strike Back (CVPR 2024)☆44Updated last year
- (ECCV 2024) Can OOD Object Detectors Learn from Foundation Models?☆25Updated last year
- [CVPR 2024] This is the official implementation of "ExtDM: Distribution Extrapolation Diffusion Model for Video Prediction"☆56Updated 7 months ago
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆30Updated last year
- [ECCV 2024] Official repository for "DataDream: Few-shot Guided Dataset Generation"☆50Updated last year
- Official code repo of PIN: Positional Insert Unlocks Object Localisation Abilities in VLMs☆26Updated last year
- Code for "How far can we go with ImageNet for Text-to-Image generation?" paper☆95Updated 2 months ago
- Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]☆100Updated 6 months ago