tingshua-yts / BetterDLLinks
☆37Updated 2 years ago
Alternatives and similar repositories for BetterDL
Users that are interested in BetterDL are comparing it to the libraries listed below
Sorting:
- The road to hack SysML and become an system expert☆500Updated last year
- how to learn PyTorch and OneFlow☆459Updated last year
- Inference code for LLaMA models☆127Updated 2 years ago
- ☆143Updated last year
- ☆139Updated last year
- [USENIX ATC '24] Accelerating the Training of Large Language Models using Efficient Activation Rematerialization and Optimal Hybrid Paral…☆66Updated last year
- A simple deep learning framework that supports automatic differentiation and GPU acceleration.☆59Updated 2 years ago
- A tutorial for CUDA&PyTorch☆161Updated 9 months ago
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆111Updated 4 months ago
- Simple Dynamic Batching Inference☆145Updated 3 years ago
- ☆129Updated 10 months ago
- Transformer related optimization, including BERT, GPT☆59Updated 2 years ago
- Models and examples built with OneFlow☆100Updated last year
- Tutorials for writing high-performance GPU operators in AI frameworks.☆134Updated 2 years ago
- ☆152Updated 10 months ago
- learning how CUDA works☆334Updated 8 months ago
- A light llama-like llm inference framework based on the triton kernel.☆161Updated last month
- A high-performance distributed deep learning system targeting large-scale and automated distributed training.☆326Updated 3 months ago
- ☆49Updated 5 years ago
- UltraScale Playbook 中文版☆87Updated 7 months ago
- Implement custom operators in PyTorch with cuda/c++☆73Updated 2 years ago
- ☆619Updated last year
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆407Updated 3 months ago
- ☆511Updated 2 months ago
- Pipeline-Parallel Lecture: Simplest Dualpipe Implementation.☆27Updated last month
- A pupil in the computer world.(Felix Fu)☆245Updated last year
- ☆43Updated 3 years ago
- A lightweight deep learning library☆391Updated 5 months ago
- DeepLearning Framework Performance Profiling Toolkit☆294Updated 3 years ago
- ☆64Updated last week