RangeKing / tuning_playbook_zh-CNLinks
A playbook for systematically maximizing the performance of deep learning models.
☆25Updated last year
Alternatives and similar repositories for tuning_playbook_zh-CN
Users that are interested in tuning_playbook_zh-CN are comparing it to the libraries listed below
Sorting:
- The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is unde…☆74Updated 4 months ago
- Awesome Colab Projects Collection☆29Updated last year
- Yet another PyTorch Trainer and some core components for deep learning.☆223Updated last year
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆44Updated last year
- deep learning template code☆65Updated last year
- About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf☆328Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆127Updated last year
- 2024 FinVolution Global Data Science Competition-9th baseline☆20Updated last year
- More light-weight pytorch experiment management library!☆69Updated 2 years ago
- The implementation of mixup and mainfold mixup method with standard models(PreActRes, WideRes, Dense) in Cifar10, Cifar100 and SVHN datas…☆48Updated 4 years ago
- ☆42Updated 10 months ago
- A Tight-fisted Optimizer☆50Updated 2 years ago
- ☆171Updated this week
- Lion and Adam optimization comparison☆64Updated 2 years ago
- Simple tutorials on Pytorch DDP training☆285Updated 3 years ago
- A bag of tricks to speed up your deep learning process☆162Updated last year
- 蜻蜓点论文 Think不Clear, 论文解读视频上传B站, youtube, 西瓜视频(同步到抖音)☆251Updated 2 years ago
- ☆220Updated 10 months ago
- Keras implement of Finite Scalar Quantization☆83Updated 2 years ago
- ☆23Updated 3 years ago
- The pure and clear PyTorch Distributed Training Framework.☆274Updated last year
- Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLP, S2MLPv2, RaftMLP, HireMLP, ConvMLP, AS-MLP, SparseMLP, Co…☆170Updated 3 years ago
- ☆365Updated 2 years ago
- A template for rapid deployment of PyTorch models.☆67Updated 3 years ago
- PyTorch implementation of the Differential-Transformer architecture for sequence modeling, specifically tailored as a decoder-only model …☆83Updated last year
- ☆85Updated 7 months ago
- Deep Learning Theory and Practice☆26Updated 2 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆196Updated 3 years ago
- Faster version of AugShuffleNet without channel shuffle, computes partially, crossovers swiftly☆11Updated 9 months ago
- Pytorch Lightning入门中文教程,转载请注明来源。(当初是写着玩的,建议看完MNIST这个例子再上手)☆229Updated 5 years ago