foocker / deeplearningtheory
☆255Updated 2 weeks ago
Alternatives and similar repositories for deeplearningtheory:
Users that are interested in deeplearningtheory are comparing it to the libraries listed below
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆148Updated 2 years ago
- 看图学大模型☆279Updated 7 months ago
- This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success o…☆268Updated 11 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆151Updated last year
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆92Updated last year
- Yet another PyTorch Trainer and some core components for deep learning.☆214Updated 10 months ago
- EasyLiterature is an open-sourced, Python-based command line tool for automatic literature management.☆268Updated 7 months ago
- ☆606Updated 9 months ago
- How to use wandb?☆625Updated last year
- Implement custom operators in PyTorch with cuda/c++☆56Updated 2 years ago
- A Telegram bot to recommend arXiv papers☆259Updated last month
- 一款便捷的抢占显卡脚本☆310Updated 2 months ago
- ☆120Updated last year
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆331Updated 6 months ago
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,417Updated 3 years ago
- A collection of phenomenons observed during the scaling of big foundation models, which may be developed into consensus, principles, or l…☆277Updated last year
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆397Updated 2 months ago
- The pure and clear PyTorch Distributed Training Framework.☆276Updated last year
- ☆182Updated 5 months ago
- Simple tutorials on Pytorch DDP training☆275Updated 2 years ago
- Cool Papers - Immersive Paper Discovery☆498Updated last week
- Code release for book "Efficient Training in PyTorch"☆50Updated 5 months ago
- Models and examples built with OneFlow☆96Updated 5 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆151Updated 5 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆161Updated 10 months ago
- real Transformer TeraFLOPS on various GPUs☆898Updated last year
- A light-weight script for maintaining a LOT of machine learning experiments.☆91Updated 2 years ago
- An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star☆177Updated 2 years ago
- Tutorials for writing high-performance GPU operators in AI frameworks.☆129Updated last year
- cnn☆133Updated 5 years ago