foocker / deeplearningtheory
☆250Updated 9 months ago
Alternatives and similar repositories for deeplearningtheory:
Users that are interested in deeplearningtheory are comparing it to the libraries listed below
- ☆598Updated 8 months ago
- 看图学大模型☆260Updated 6 months ago
- Yet another PyTorch Trainer and some core components for deep learning.☆211Updated 9 months ago
- A bag of tricks to speed up your deep learning process☆160Updated 9 months ago
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆145Updated 2 years ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,399Updated 3 years ago
- The pure and clear PyTorch Distributed Training Framework.☆276Updated last year
- Models and examples built with OneFlow☆96Updated 4 months ago
- PaddlePaddle Code Convert Toolkit. 『飞桨』深度学习代码转换工具☆95Updated this week
- ☆100Updated 3 years ago
- PyTorch Project Specification.☆674Updated 3 years ago
- Cool Papers - Immersive Paper Discovery☆469Updated last week
- Implement custom operators in PyTorch with cuda/c++☆53Updated 2 years ago
- A lightweight deep learning library☆382Updated last month
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆151Updated 4 months ago
- A simple deep learning framework in pure python for purpose of learning in DL☆434Updated last week
- Triton Documentation in Chinese Simplified / Triton 中文文档☆54Updated last month
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆262Updated 10 months ago
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆87Updated 11 months ago
- ☆120Updated last year
- Tutorials for writing high-performance GPU operators in AI frameworks.☆128Updated last year
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆120Updated 3 years ago
- Simple tutorials on Pytorch DDP training☆273Updated 2 years ago
- real Transformer TeraFLOPS on various GPUs☆896Updated last year
- 一款便捷的抢占显卡脚本☆300Updated last month
- How to use wandb?☆611Updated last year
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆170Updated 2 years ago
- ☆76Updated last year
- A Note for Machine Learning Algorithms☆87Updated 2 years ago
- A self-learning tutorail for CUDA High Performance Programing.☆369Updated 2 months ago