baco-authors / baco
☆16Updated last year
Alternatives and similar repositories for baco:
Users that are interested in baco are comparing it to the libraries listed below
- ☆29Updated 3 years ago
- ☆14Updated 2 years ago
- Differentiable Combinatorial Scheduling at Scale (ICML'24). Mingju Liu, Yingjie Li, Jiaqi Yin, Zhiru Zhang, Cunxi Yu.☆19Updated 4 months ago
- ☆14Updated last year
- ☆19Updated this week
- Code base for OOPSLA'24 paper: UniSparse: An Intermediate Language for General Sparse Format Customization☆30Updated 4 months ago
- ☆30Updated 2 years ago
- A unified programming framework for high and portable performance across FPGAs and GPUs☆11Updated this week
- one-shot-tuner☆8Updated 2 years ago
- Sparse kernels for GNNs based on TVM☆16Updated 4 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆18Updated 2 years ago
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆28Updated 3 years ago
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆23Updated 3 months ago
- GPU Performance Advisor☆64Updated 2 years ago
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆18Updated 2 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 5 years ago
- ☆40Updated this week
- A retargetable and extensible synthesis-based compiler for modern hardware architectures☆10Updated this week
- Implementation of FusedMM method for IPDPS 2021 paper titled "FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural N…☆30Updated 2 years ago
- ☆9Updated 3 years ago
- A graph linear algebra overlay☆52Updated last year
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆39Updated 3 years ago
- cuASR: CUDA Algebra for Semirings☆35Updated 2 years ago
- Data-Centric MLIR dialect☆40Updated last year
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆61Updated 2 years ago
- Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)☆15Updated 4 years ago
- Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits☆26Updated 7 months ago
- HeteroCL-MLIR dialect for accelerator design☆41Updated 6 months ago
- ☆18Updated last month
- ☆42Updated 11 months ago