google / tslLinks
☆98Updated last week
Alternatives and similar repositories for tsl
Users that are interested in tsl are comparing it to the libraries listed below
Sorting:
- A lightweight memory allocator for hardware-accelerated machine learning☆157Updated 4 months ago
- ☆16Updated 2 years ago
- ☆95Updated this week
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆109Updated last year
- ☆312Updated 3 weeks ago
- oneAPI Collective Communications Library (oneCCL)☆241Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆119Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆346Updated this week
- MLIR-based partitioning system☆115Updated this week
- ☆78Updated this week
- Bazel wrapper around the pybind11 repository☆107Updated 2 weeks ago
- Pybind11 bindings for the Abseil C++ Common Libraries☆27Updated 2 weeks ago
- Unified Collective Communication Library☆263Updated last week
- Machine learning for machine code.☆91Updated 3 weeks ago
- ☆420Updated this week
- RAPIDS Memory Manager☆603Updated this week
- ☆58Updated last week
- Monorepo for the OpenCilk compiler. Forked from llvm/llvm-project and based on Tapir/LLVM.☆114Updated last week
- LLM training in simple, raw C/CUDA☆103Updated last year
- oneAPI Specification source files☆206Updated last week
- A GPU-driven system framework for scalable AI applications☆117Updated 6 months ago
- ☆62Updated last year
- ☆61Updated this week
- ☆30Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆68Updated this week
- Starlark implementation of bazel rules for CUDA.☆112Updated this week
- ☆70Updated 4 months ago
- Next generation LAPACK implementation for ROCm platform☆108Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Updated this week
- Experiments and prototypes associated with IREE or MLIR☆54Updated last year