RangeKing / tuning_playbook_zh-CNLinks
A playbook for systematically maximizing the performance of deep learning models.
☆25Updated last year
Alternatives and similar repositories for tuning_playbook_zh-CN
Users that are interested in tuning_playbook_zh-CN are comparing it to the libraries listed below
Sorting:
- 2024 FinVolution Global Data Science Competition-9th baseline☆19Updated last year
- Yet another PyTorch Trainer and some core components for deep learning.☆224Updated last year
- 蜻蜓点论文 Think不Clear, 论文解读视频上传B站, youtube, 西瓜视频(同步到抖音)☆246Updated last year
- The implementation of mixup and mainfold mixup method with standard models(PreActRes, WideRes, Dense) in Cifar10, Cifar100 and SVHN datas…☆48Updated 3 years ago
- The pure and clear PyTorch Distributed Training Framework.☆274Updated last year
- More light-weight pytorch experiment management library!☆68Updated 2 years ago
- ☆44Updated 7 months ago
- Lion and Adam optimization comparison☆63Updated 2 years ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆117Updated last year
- Simple tutorials on Pytorch DDP training☆281Updated 3 years ago
- The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is unde…☆55Updated 3 weeks ago
- About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf☆326Updated last year
- ☆85Updated 2 years ago
- ☆168Updated this week
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆75Updated last year
- A bag of tricks to speed up your deep learning process☆161Updated last year
- State Space Models☆70Updated last year
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆196Updated 2 years ago
- ☆215Updated 6 months ago
- Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task☆47Updated 3 years ago
- Awesome Colab Projects Collection☆27Updated last year
- deep learning template code☆66Updated last year
- DeepSpeed Tutorial☆101Updated last year
- A collection of SOTA Image Classification Models in PyTorch☆163Updated 3 years ago
- A Tight-fisted Optimizer☆50Updated 2 years ago
- ☆40Updated last year
- ☆23Updated 2 years ago
- ☆82Updated 3 months ago
- Implementation of Conv-based and Vit-based networks designed for CIFAR.☆70Updated 2 years ago