deepseek-ai / profile-dataLinks
Analyze computation-communication overlap in V3/R1.
☆1,128Updated 9 months ago
Alternatives and similar repositories for profile-data
Users that are interested in profile-data are comparing it to the libraries listed below
Sorting:
- Expert Parallelism Load Balancer☆1,322Updated 9 months ago
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.☆2,892Updated 9 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling