RangeKing / tuning_playbook_zh-CNLinks
A playbook for systematically maximizing the performance of deep learning models.
☆25Updated last year
Alternatives and similar repositories for tuning_playbook_zh-CN
Users that are interested in tuning_playbook_zh-CN are comparing it to the libraries listed below
Sorting:
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆43Updated 6 months ago
- ☆23Updated 2 years ago
- Lion and Adam optimization comparison☆61Updated 2 years ago
- A Tight-fisted Optimizer☆48Updated 2 years ago
- Awesome Colab Projects Collection☆27Updated last year
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- 2024 FinVolution Global Data Science Competition-9th baseline☆18Updated last year
- Moved to https://github.com/NUS-HPC-AI-Lab/InfoBatch☆6Updated last year
- ☆27Updated 2 years ago
- [NeurIPS'22] Projector Ensemble Feature Distillation☆29Updated last year
- The implementation of mixup and mainfold mixup method with standard models(PreActRes, WideRes, Dense) in Cifar10, Cifar100 and SVHN datas…☆48Updated 3 years ago
- deep learning template code☆66Updated last year
- Implementation of dynamic temporal pooling (DTP) for time series classification☆39Updated 3 years ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆102Updated last year
- 蜻蜓点论文 Think不Clear, 论文解读视频上传B站, youtube, 西瓜视频(同步到抖音)☆244Updated last year
- This is code of paper "ScalableViT: Rethinking the Context-oriented Generalization of Vision Transformer"☆26Updated last year
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆20Updated 6 months ago
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆29Updated last year
- Implementation of vision transformer. ⭐⭐⭐☆33Updated 3 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆76Updated last year
- Workshop on Foundation Model 1st foundation model challenge Track1 codebase (Open TransMind v1.0)☆18Updated 2 years ago
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆140Updated 2 years ago
- ☆42Updated 5 months ago
- A list of papers, codes and applications on multi-task learning.☆72Updated 3 months ago
- This project is an unofficial summary of the resources related to VALSE and its annual seminar. Its main purpose is to more facilitate yo…☆19Updated last year
- ☆52Updated 2 years ago
- Codes For Sharing☆39Updated 4 years ago
- State Space Models☆67Updated last year
- # Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang P…☆34Updated 2 years ago