calculon-ai / calculon
☆129Updated last year
Alternatives and similar repositories for calculon:
Users that are interested in calculon are comparing it to the libraries listed below
- Synthesizer for optimal collective communication algorithms☆104Updated 11 months ago
- LLM serving cluster simulator☆93Updated 10 months ago
- Microsoft Collective Communication Library☆340Updated last year
- ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale☆327Updated 2 weeks ago
- A baseline repository of Auto-Parallelism in Training Neural Networks☆143Updated 2 years ago
- ☆75Updated 2 years ago
- LLM Inference analyzer for different hardware platforms☆54Updated this week
- Repository for MLCommons Chakra schema and tools☆89Updated 2 weeks ago
- Curated collection of papers in machine learning systems☆256Updated last week
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆107Updated 2 years ago
- TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches☆71Updated last year
- ☆121Updated 8 months ago
- An Efficient Pipelined Data Parallel Approach for Training Large Model☆74Updated 4 years ago
- Repository for MLCommons Chakra schema and tools☆39Updated last year
- ☆79Updated 3 months ago
- ☆132Updated 2 months ago
- Paella: Low-latency Model Serving with Virtualized GPU Scheduling☆58Updated 10 months ago
- AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)☆82Updated last year
- Automatic Schedule Exploration and Optimization Framework for Tensor Computations☆175Updated 2 years ago
- This repository is established to store personal notes and annotated papers during daily research.☆112Updated last week
- NCCL Profiling Kit☆127Updated 8 months ago
- ☆100Updated last week
- A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores☆50Updated last year
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆92Updated 2 years ago
- ☆36Updated last year
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆86Updated 2 years ago
- Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines.☆60Updated last year
- ☆90Updated 10 months ago
- ☆23Updated 2 years ago
- ☆21Updated 2 years ago