PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes
☆67May 5, 2024Updated 2 years ago
Alternatives and similar repositories for flexivit
Users that are interested in flexivit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆34Feb 17, 2025Updated last year
- PyTorch implementation of Gaussian word embeddings☆19Apr 7, 2018Updated 8 years ago
- We're Not Using Videos Effectively (TMLR 2024)☆17Feb 4, 2024Updated 2 years ago
- ☆15Dec 15, 2015Updated 10 years ago
- (ICCV 2023) Vision Transformer Adapters for Generalizable Multitask Learning☆20Apr 25, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- SegEval is a Python library that provides tools for evaluating semantic segmentation models☆14Jun 29, 2023Updated 2 years ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated last year
- Repository for the paper: Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning☆18Feb 21, 2025Updated last year
- Self-Weighted Contrastive Learning among Multiple Views for Mitigating Representation Degeneration☆10Oct 23, 2023Updated 2 years ago
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆28Jul 21, 2025Updated 10 months ago
- [IJCAI'2022] FOGS: First-Order Gradient Supervision with Learning-based Graph for Traffic Flow Forecasting☆27Aug 25, 2023Updated 2 years ago
- When do we not need larger vision models?☆419Feb 8, 2025Updated last year
- The code will come soon.☆16Sep 12, 2025Updated 9 months ago
- ☆17Feb 23, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- Twin Contrastive Learning with Noisy Labels (CVPR 2023)☆74Aug 4, 2023Updated 2 years ago
- Learning 1D Causal Visual Representation with De-focus Attention Networks☆35Jun 7, 2024Updated 2 years ago
- My implementation of "Patch n’ Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution"☆272Apr 13, 2026Updated 2 months ago
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆27Oct 27, 2023Updated 2 years ago
- ☆17Jun 28, 2024Updated last year
- [TIP 2023] Co-Learning Meets Stitch-Up for Noisy Multi-label Visual Recognition.☆13Aug 19, 2023Updated 2 years ago
- This is the official implementation of the IndexNet.☆11May 23, 2023Updated 3 years ago
- Transformer model based on Kolmogorov–Arnold Network(KAN), which is an alternative of Multi-Layer Perceptron(MLP)☆29May 19, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆117Aug 21, 2025Updated 9 months ago
- Abstract. Person search is a challenging problem with various real- world applications, that aims at joint person detection and re-identi…☆13Feb 28, 2024Updated 2 years ago
- ☆48Aug 7, 2023Updated 2 years ago
- About PyTorch implementation for ‘’Robust Multi-View Clustering with Noisy Correspondence‘’ (TKDE 2024)☆11Aug 2, 2024Updated last year
- PaperHelper: Knowledge-Based LLM QA Paper Reading Assistant with Reliable References☆21Jun 13, 2024Updated 2 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆14Jun 26, 2021Updated 4 years ago
- Huggingface implementation of MVDream for easy import☆16Mar 31, 2025Updated last year
- ☆11Sep 1, 2024Updated last year
- INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model☆42Aug 4, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- codes for paper "AttCAT: Explaining Transformers via Attentive Class Activation Tokens"☆12May 13, 2024Updated 2 years ago
- Continual Barlow Twins (Barlow Twins with EWC implementation)☆14May 17, 2022Updated 4 years ago
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆466Oct 29, 2025Updated 7 months ago
- “Open terminals”, “load CSVs”, “start hacking”☆16May 2, 2017Updated 9 years ago
- ODEON is a task-agnostic framework for deep learning applied to remote sensing☆22May 25, 2026Updated 3 weeks ago
- Multi-Layer Network Model leverages identification of spatial domains from spatial transcriptomics data☆13Aug 25, 2024Updated last year
- The official project website of "3D Human Pose Lifting with Grid Convolution" (GridConv for short, oral in AAAI 2023)☆34Dec 19, 2023Updated 2 years ago