wangguojim / LargeScale
☆18Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for LargeScale
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- Transformer related optimization, including BERT, GPT☆17Updated last year
- Odysseus: Playground of LLM Sequence Parallelism☆57Updated 5 months ago
- ☆74Updated 11 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated last year
- Distributed DataLoader For Pytorch Based On Ray☆24Updated 3 years ago
- Summary of system papers/frameworks/codes/tools on training or serving large model☆56Updated 11 months ago
- Distributed IO-aware Attention algorithm☆17Updated 3 months ago
- ☆23Updated last year