[ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"
☆82Jul 21, 2022Updated 3 years ago
Alternatives and similar repositories for tiny-transformers
Users that are interested in tiny-transformers are comparing it to the libraries listed below
Sorting:
- [CVPR 2023] Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning☆22Jun 11, 2023Updated 2 years ago
- Fuzzy Positive Learning (CVPR2023)☆15Jul 25, 2024Updated last year
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Nov 22, 2022Updated 3 years ago
- Code of our Neurips2020 paper "Auto Learning Attention", coming soon☆22Apr 14, 2021Updated 4 years ago
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆37Nov 25, 2022Updated 3 years ago
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- Unified Multi-modal IAA Baseline and Benchmark☆93Sep 27, 2024Updated last year
- GPT-4V(ision) as A Social Media Analysis Engine☆38Dec 20, 2024Updated last year
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆155Dec 28, 2022Updated 3 years ago
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- (ECCV2022) EAGAN: EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs☆12Sep 15, 2022Updated 3 years ago
- ☆10Oct 7, 2019Updated 6 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation☆25Jul 8, 2023Updated 2 years ago
- Precision Search through Multi-Style Inputs☆73Jul 30, 2025Updated 7 months ago
- ☆15Apr 13, 2023Updated 2 years ago
- ☆14Dec 25, 2020Updated 5 years ago
- [NeurIPS'22] Projector Ensemble Feature Distillation☆30Jan 4, 2024Updated 2 years ago
- Implementation of HAT https://arxiv.org/pdf/2204.00993☆51Mar 23, 2024Updated last year
- [ECCV 2022]Code for paper "DaViT: Dual Attention Vision Transformer"☆374Feb 13, 2024Updated 2 years ago
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Nov 14, 2022Updated 3 years ago
- [NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective☆37Dec 15, 2022Updated 3 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- ☆12May 22, 2022Updated 3 years ago
- ☆13Jun 28, 2021Updated 4 years ago
- ☆13Jun 8, 2021Updated 4 years ago
- UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)☆89Jun 12, 2023Updated 2 years ago
- LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry☆45Oct 9, 2025Updated 4 months ago
- Source code for NeurIPS 2022 paper SoLar☆30Dec 20, 2023Updated 2 years ago
- 陆续开源医疗行业的深度学习模型及数据集☆13Dec 30, 2021Updated 4 years ago
- ICML2019 Accepted Paper. Overcoming Multi-Model Forgetting☆14Jun 5, 2019Updated 6 years ago
- V1: Toward Multimodal Reasoning by Designing Auxiliary Task☆36Apr 14, 2025Updated 10 months ago
- 'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)☆242Oct 10, 2023Updated 2 years ago
- [ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"☆291Apr 25, 2022Updated 3 years ago
- BESA is a differentiable weight pruning technique for large language models.☆17Mar 4, 2024Updated last year
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Mar 24, 2023Updated 2 years ago
- Benchmarks for Macro Neural Architecture Search; used and described in the paper "Local Search is a Remarkably Strong Baseline for Neural…☆12Jul 25, 2024Updated last year
- Masked Generative Distillation (ECCV 2022)☆240Nov 9, 2022Updated 3 years ago