DeepAuto-AI / hip-attention
Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
☆14Updated this week
Related projects: ⓘ
- [ECCV2024][ICCV2023] Official PyTorch implementation of SeiT++ and SeiT☆51Updated last month
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆44Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆38Updated last year
- [ICLR 2024] The Need for Speed: Pruning Transformers with One Recipe☆20Updated 2 weeks ago
- Official PyTorch implementation of MaskSub "Masking Augmentation for Supervised Learning"☆34Updated 6 months ago
- ☆36Updated last year
- ☆17Updated last year
- Learning Features with Parameter-Free Layers, ICLR 2022☆85Updated last year
- [ICLR 2023] RC-MAE☆51Updated 9 months ago
- ☆44Updated 4 months ago
- PyTorch implementation of FFN : "Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains (NeurIPS2020)"☆16Updated last year
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆19Updated 7 months ago
- Model Stock: All we need is just a few fine-tuned models☆75Updated 5 months ago
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆29Updated last year
- ☆10Updated 2 years ago
- ☆48Updated 11 months ago
- Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)☆13Updated last year
- ☆48Updated 8 months ago
- Official Pytorch Implementation of Unsupervised Representation Learning for Binary Networks by Joint Classifier Training (CVPR 2022)☆10Updated 2 years ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆25Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆66Updated last month
- Github repository for Zero Shot Visual Storytelling☆15Updated 2 years ago
- [ICLR-2023] Rarity Score : A New Metric to Evaluate the Uncommonness of Synthesized Images☆59Updated 2 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆66Updated last year
- 이화여대 강의자료☆25Updated 7 months ago
- Course Website for "AI618: Generative Model and Unsupervised Learning"☆35Updated last year
- ImageNet-12k subset of ImageNet-21k (fall11)☆19Updated last year
- Official repository for Automated Learning Rate Scheduler for Large-Batch Training (8th ICML Workshop on AutoML)☆39Updated 2 years ago
- [Neurips 2023] Official pytorch implementation of "Addressing Negative Transfer in Diffusion Models"☆14Updated 2 months ago
- ☆22Updated last week