☆56Sep 28, 2023Updated 2 years ago
Alternatives and similar repositories for mixed-resolution-vit
Users that are interested in mixed-resolution-vit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- CVPR2023: Vector Quantization with Self-Attention for Quality-Independent Representation Learning.☆14May 17, 2024Updated 2 years ago
- ☆26Aug 9, 2025Updated 9 months ago
- ☆16Aug 7, 2024Updated last year
- [NAACL 2022] TreeMix: Compositional Constituency-based Data Augmentation for Natural Language Understanding☆10Jul 15, 2023Updated 2 years ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Official implementation for Wavelet Feature Maps Compression for Image-to-Image CNNs, NeurIPS 2022.☆37Oct 12, 2022Updated 3 years ago
- ☆11Jun 22, 2025Updated 11 months ago
- "From ViT Features to Training-free Video Object Segmentation via Streaming-data Mixture Models" [Uziel, Dinari, and Freifeld, NeurIPS 20…☆13Jan 16, 2024Updated 2 years ago
- (MICCAI 2024 Early Acc)Advancing UWF-SLO Vessel Segmentation with Source-Free Active Domain Adaptation and a Novel Multi-Center Dataset☆21Dec 25, 2024Updated last year
- Kaggle ultrasound nerve segmentation using Keras☆23Jan 22, 2017Updated 9 years ago
- [EMNLP'2023 Findings] MoqaGPT, for zero-shot multimodal question answering with LLMs☆13Dec 28, 2024Updated last year
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- Consistent Amortized Clustering via Generative Flow Networks☆12Feb 27, 2025Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2025 Highlight] Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective.☆86Jul 24, 2025Updated 10 months ago
- KDD 2025: MetamatBench: Integrating Heterogeneous Data, Computational Tools, and Visual Interface for Metamaterial Discovery☆19Jun 14, 2025Updated 11 months ago
- towhee+elasticsearch实现本地以图搜图☆11Apr 23, 2023Updated 3 years ago
- An official code release of the paper RGB no more: Minimally Decoded JPEG Vision Transformers☆57Jul 11, 2023Updated 2 years ago
- Code for Learned Thresholds Token Merging and Pruning for Vision Transformers (LTMP). A technique to reduce the size of Vision Transforme…☆17Nov 24, 2024Updated last year
- TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor Learning. ICML 2025.☆29Jul 14, 2025Updated 10 months ago
- Moving-camera background model (a CVPR '20 paper)☆16Oct 22, 2020Updated 5 years ago
- ☆24Jun 13, 2022Updated 3 years ago
- a tiny project to test the effectiveness of video QA through RAG techniques and multimodal LLMs☆15Jun 2, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Experimental LDM uses of Paella's architecture☆34Jan 26, 2023Updated 3 years ago
- [NeurIPS24] VisMin: Visual Minimal-Change Understanding☆19Mar 3, 2025Updated last year
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated 2 years ago
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- Official code for the paper "A Plug-and-Play Image Registration Network"☆11Mar 19, 2024Updated 2 years ago
- A Spitting Image: Modular Superpixel Tokenization in Vision Transformers☆22Sep 12, 2025Updated 8 months ago
- Piano roll made in React☆15Jul 9, 2023Updated 2 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).☆19Apr 1, 2021Updated 5 years ago
- Official PyTorch Implementation of Exploring Stochastic Autoregressive Image Modeling for Visual Representation, Accepted by AAAI 2023.☆16Jul 3, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- "Revisiting DP-Means: Fast Scalable Algorithms via Parallelism and Delayed Cluster Creation" [Dinari and Freifeld, UAI 2022]☆21Jul 20, 2024Updated last year
- ☆13Mar 26, 2025Updated last year
- ☆13Sep 26, 2023Updated 2 years ago
- [ICCV 2023] Code for Prune Spatio-temporal Tokens by Semantic-aware Temporal Accumulation☆23Dec 12, 2023Updated 2 years ago
- ☆12Nov 16, 2020Updated 5 years ago
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆35Jun 12, 2025Updated 11 months ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆604Oct 6, 2024Updated last year