RangeKing / tuning_playbook_zh-CNLinks
A playbook for systematically maximizing the performance of deep learning models.
☆25Updated last year
Alternatives and similar repositories for tuning_playbook_zh-CN
Users that are interested in tuning_playbook_zh-CN are comparing it to the libraries listed below
Sorting:
- Yet another PyTorch Trainer and some core components for deep learning.☆222Updated last year
- A bag of tricks to speed up your deep learning process☆163Updated last year
- 蜻蜓点论文 Think不Clear, 论文解读视频上传B站, youtube, 西瓜视频(同步到抖音)☆251Updated 2 years ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆129Updated last year
- The implementation of mixup and mainfold mixup method with standard models(PreActRes, WideRes, Dense) in Cifar10, Cifar100 and SVHN datas…☆48Updated 4 years ago
- 2024 FinVolution Global Data Science Competition-9th baseline☆20Updated last year
- The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is unde…☆77Updated 5 months ago
- Simple tutorials on Pytorch DDP training☆286Updated 3 years ago
- ☆23Updated 3 years ago
- About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf☆331Updated last year
- ATEC2023——赛道一: 大模型的知识引入Rank7方案分享☆26Updated last year
- Pytorch Lightning入门中文教程,转载请注明来源。(当初是写着玩的,建议看完MNIST这个例子再上手)☆231Updated 5 years ago
- 1st solution for the Webly-supervised Fine-grained Recognition competition in https://www.cvmart.net/race/10412/base☆43Updated 2 years ago
- ☆173Updated last week
- More light-weight pytorch experiment management library!☆70Updated 2 years ago
- ☆222Updated 11 months ago
- Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly☆11Updated 11 months ago
- Implementation of "Attention Is Off By One" by Evan Miller☆198Updated 2 years ago
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆121Updated last week
- PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model …☆85Updated last year
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆84Updated 2 years ago
- DeepSpeed Tutorial☆105Updated last year
- Awesome Colab Projects Collection☆29Updated 2 years ago
- The pure and clear PyTorch Distributed Training Framework.☆275Updated 2 years ago
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆196Updated 3 years ago
- ☆85Updated 2 years ago
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆250Updated 2 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆76Updated 2 years ago
- Lion and Adam optimization comparison☆64Updated 2 years ago
- deep learning template code☆66Updated last month