deepseek-ai / profile-dataLinks
Analyze computation-communication overlap in V3/R1.
☆1,124Updated 8 months ago
Alternatives and similar repositories for profile-data
Users that are interested in profile-data are comparing it to the libraries listed below
Sorting:
- Expert Parallelism Load Balancer☆1,315Updated 8 months ago
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.☆2,884Updated 8 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling