PyTorch reimplementation of FlexiViT: One Model for All Patch Sizes
☆67May 5, 2024Updated last year
Alternatives and similar repositories for flexivit
Users that are interested in flexivit are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Gaussian word embeddings☆19Apr 7, 2018Updated 7 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- Elastic Workplace Search Official Python Client☆10Aug 8, 2024Updated last year
- [ICLR'23] Effective Self-supervised Pre-training on Low-compute networks without Distillation☆18Oct 9, 2024Updated last year
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Apr 9, 2022Updated 3 years ago
- SAM-CLIP module for use with Autodistill.☆17Nov 21, 2023Updated 2 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Jun 26, 2021Updated 4 years ago
- [EMNLP'23] The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆108Aug 21, 2025Updated 6 months ago
- Huggingface implementation of MVDream for easy import☆16Mar 31, 2025Updated 11 months ago
- We're Not Using Videos Effectively (TMLR 2024)☆17Feb 4, 2024Updated 2 years ago
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆27Jul 21, 2025Updated 7 months ago
- “Open terminals”, “load CSVs”, “start hacking”☆16May 2, 2017Updated 8 years ago
- ☆18Jun 1, 2021Updated 4 years ago
- This repo provides some examples (mostly solving PDE) to compare KAN and MLP☆19May 3, 2024Updated last year
- Awesome Chinese Corpus Datasets and Models.☆18Oct 28, 2019Updated 6 years ago
- Codes for arXiv paper "Semi-supervised Few-shot Atomic Action Recognition".☆18Jan 2, 2021Updated 5 years ago
- When do we not need larger vision models?☆413Feb 8, 2025Updated last year
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)☆191Nov 17, 2023Updated 2 years ago
- Transformer model for the Amazon Topical-Chat Corpus. Baselines for DSTC9 Track 3.☆19Jul 9, 2020Updated 5 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Jul 8, 2021Updated 4 years ago
- Pytorch Implementation of Deepmind's SIMA: "Scaling Instructable Agents Across Many Simulated Worlds"☆29Jun 17, 2024Updated last year
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆27Oct 27, 2023Updated 2 years ago
- ☆24Sep 25, 2024Updated last year
- baseline mode for the ObjectNet competition☆18Jan 13, 2021Updated 5 years ago
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆102Sep 11, 2024Updated last year
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- An implementation of DIP-VAE from the paper "Variational Inference of Disentangled Latent Concepts from Unlabelled Observations" by Kumar…☆26Apr 20, 2018Updated 7 years ago
- Notes on Deep Reinforcement Learning for Natural Language Processing papers☆30Jul 17, 2017Updated 8 years ago
- Code and Data for the paper Investigating Evaluation of Open-Domain Dialogue Systems With Human Generated Multiple References SIGdial 201…☆28Mar 6, 2020Updated 6 years ago
- An open source implementation of the paper: "Sequential Diagnosis with Language Models" From Microsoft Built with Swarms Framework☆52Oct 13, 2025Updated 4 months ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Feb 3, 2023Updated 3 years ago
- Explore how to get a VQ-VAE models efficiently!☆68Jul 24, 2025Updated 7 months ago
- [IJCAI'2022] FOGS: First-Order Gradient Supervision with Learning-based Graph for Traffic Flow Forecasting☆28Aug 25, 2023Updated 2 years ago
- Transformer model based on Kolmogorov–Arnold Network(KAN), which is an alternative of Multi-Layer Perceptron(MLP)☆29Jan 24, 2026Updated last month
- ☆32Mar 7, 2022Updated 3 years ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Jul 22, 2023Updated 2 years ago
- Masked Autoencoder meets GANs☆31Dec 27, 2023Updated 2 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Jul 9, 2020Updated 5 years ago