A Data-Centric Compiler for Machine Learning
☆85Dec 14, 2025Updated 2 months ago
Alternatives and similar repositories for daceml
Users that are interested in daceml are comparing it to the libraries listed below
Sorting:
- DaCe - Data Centric Parallel Programming☆576Updated this week
- ☆17Sep 15, 2021Updated 4 years ago
- ☆14Nov 7, 2025Updated 3 months ago
- Data-Centric MLIR dialect☆46Oct 16, 2023Updated 2 years ago
- C++/MPI proxies for distributed training of deep neural networks.☆15Jun 18, 2022Updated 3 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 2 years ago
- ☆41Oct 9, 2025Updated 4 months ago
- Sparsity support for PyTorch☆38Mar 22, 2025Updated 11 months ago
- triton ver of gqa flash attn, based on the tutorial☆12Aug 4, 2024Updated last year
- Hardware Accelerated MWPM decoder for Quantum Error Correction☆18Mar 23, 2025Updated 11 months ago
- ☆10Mar 2, 2024Updated 2 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Aug 1, 2021Updated 4 years ago
- Fortran source code analysis tool☆11Nov 11, 2020Updated 5 years ago
- ☆11Apr 6, 2024Updated last year
- Compiler toolchain to enable generation of high-level DSLs for geophysical fluid dynamics models☆29Mar 22, 2023Updated 2 years ago
- Research and development for optimizing transformers☆131Feb 16, 2021Updated 5 years ago
- The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".☆13Jun 7, 2021Updated 4 years ago
- A flexible (Python-based) quantum program compiler☆15Feb 27, 2026Updated last week
- NPBench - A Benchmarking Suite for High-Performance NumPy☆91Jan 28, 2026Updated last month
- A common package to provide example files (e.g., ROOT) for testing and developing packages against.☆13Feb 25, 2026Updated last week
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆34Jan 30, 2026Updated last month
- dirty toolkit☆20Nov 1, 2020Updated 5 years ago
- bosonic quantum circuits☆18Jul 13, 2025Updated 7 months ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- This repository has moved, please visit https://github.com/ai2cm/pace for the latest development of fv3core.☆13Dec 21, 2022Updated 3 years ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Mar 1, 2022Updated 4 years ago
- Medical ML Benchmark☆11May 16, 2023Updated 2 years ago
- ☆13Mar 6, 2023Updated 3 years ago
- Automated DNN generation for fuzz testing and more☆143Jan 14, 2025Updated last year
- Tool for the deployment and analysis of TinyML applications on TFLM and MicroTVM backends☆33Updated this week
- Implementation of the FV3GFS / SHiELD atmospheric model in Python☆38Apr 23, 2024Updated last year
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆91Nov 23, 2022Updated 3 years ago
- ☆14Mar 10, 2024Updated last year
- MLIR tools and dialect for GraphBLAS☆18Mar 30, 2022Updated 3 years ago
- A Deep-Reinforcement-Learning-Based Scheduler for FPGA HLS☆15Feb 27, 2021Updated 5 years ago
- ImageNet training code of Res2Net☆15Nov 2, 2020Updated 5 years ago
- Switch-based Training Acceleration for Machine Learning (SwitchML)☆16Apr 13, 2021Updated 4 years ago
- A backend-dispatchable version of NumPy.☆19Feb 27, 2021Updated 5 years ago