RangeKing / tuning_playbook_zh-CN
A playbook for systematically maximizing the performance of deep learning models.
☆25Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for tuning_playbook_zh-CN
- 2024 FinVolution Global Data Science Competition-9th baseline☆18Updated 6 months ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆78Updated 8 months ago
- Yet another PyTorch Trainer and some core components for deep learning.☆206Updated 6 months ago
- Implementation of vision transformer. ⭐⭐⭐☆29Updated 3 years ago
- Awesome Colab Projects Collection☆25Updated 10 months ago
- About Code release for "Flowformer: Linearizing Transformers with Conservation Flows" (ICML 2022), https://arxiv.org/pdf/2202.06258.pdf☆305Updated 4 months ago
- auto scrawl for arrive data☆12Updated 2 years ago
- ☆22Updated 2 years ago
- ☆64Updated 2 years ago
- 蜻蜓点论文 Think不Clear, 论文解读视频上传B站, youtube, 西瓜视频(同步到抖音)☆240Updated last year
- ☆15Updated 2 years ago
- [ICLR 2022] Official implementation of cosformer-attention in cosFormer: Rethinking Softmax in Attention☆180Updated last year
- Moved to https://github.com/NUS-HPC-AI-Lab/InfoBatch☆6Updated 10 months ago
- ☆50Updated last year
- The implementation of mixup and mainfold mixup method with standard models(PreActRes, WideRes, Dense) in Cifar10, Cifar100 and SVHN datas…☆42Updated 2 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 3 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆81Updated 8 months ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆71Updated last year
- Lion and Adam optimization comparison☆56Updated last year
- # Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang P…☆34Updated last year
- AIGCDetectBaseline☆11Updated 4 months ago
- Keras implement of Finite Scalar Quantization☆64Updated last year
- Implementation of Denoising Diffusion Probabilistic Model in MindSpore☆32Updated last year
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆29Updated 2 years ago
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆77Updated last year
- 我的AI学习笔记。包括b站up主deep_thoughts的PyTorch课程笔记和相关代码;北邮深度学习与数字视频PPT代码。☆19Updated 5 months ago
- The pure and clear PyTorch Distributed Training Framework.☆275Updated 10 months ago
- State Space Models☆63Updated 6 months ago
- More light-weight pytorch experiment management library!☆63Updated last year
- A repository for DenseSSMs☆88Updated 7 months ago