RangeKing / tuning_playbook_zh-CNLinks
A playbook for systematically maximizing the performance of deep learning models.
☆25Updated last year
Alternatives and similar repositories for tuning_playbook_zh-CN
Users that are interested in tuning_playbook_zh-CN are comparing it to the libraries listed below
Sorting:
- 2024 FinVolution Global Data Science Competition-9th baseline☆19Updated last year
- Yet another PyTorch Trainer and some core components for deep learning.☆222Updated last year
- 蜻蜓点论文 Think不Clear, 论文解读视频上传B站, youtube, 西瓜视频(同步到抖音)☆245Updated last year
- The implementation of mixup and mainfold mixup method with standard models(PreActRes, WideRes, Dense) in Cifar10, Cifar100 and SVHN datas…☆48Updated 3 years ago
- ☆213Updated 5 months ago
- About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf☆325Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆115Updated last year
- ☆44Updated 6 months ago
- ☆23Updated 2 years ago
- Awesome Colab Projects Collection☆27Updated last year
- The pure and clear PyTorch Distributed Training Framework.☆275Updated last year
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆109Updated last week
- Lion and Adam optimization comparison☆63Updated 2 years ago
- Simple tutorials on Pytorch DDP training☆281Updated 2 years ago
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆43Updated 8 months ago
- Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLP, S2MLPv2, RaftMLP, HireMLP, ConvMLP, AS-MLP, SparseMLP, Co…☆170Updated 3 years ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆147Updated 2 years ago
- ☆81Updated 3 months ago
- deep learning template code☆66Updated last year
- PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model …☆73Updated 9 months ago
- Keras implement of Finite Scalar Quantization☆79Updated last year
- State Space Models☆70Updated last year
- iFormer: Inception Transformer☆248Updated 2 years ago
- Pytorch Lightning入门中文教程,转载请注明来源。(当初是写着玩的,建议看完MNIST这个例子再上手)☆221Updated 4 years ago
- A bag of tricks to speed up your deep learning process☆160Updated last year
- A light-weight script for maintaining a LOT of machine learning experiments.☆92Updated 2 years ago
- ☆167Updated this week
- The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is unde…☆35Updated last week
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆82Updated last year