google / tslLinks
☆98Updated this week
Alternatives and similar repositories for tsl
Users that are interested in tsl are comparing it to the libraries listed below
Sorting:
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆109Updated last year
- Pybind11 bindings for the Abseil C++ Common Libraries☆27Updated this week
- ☆16Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆119Updated this week
- A lightweight memory allocator for hardware-accelerated machine learning☆169Updated 5 months ago
- A GPU-driven system framework for scalable AI applications☆118Updated 7 months ago
- ☆83Updated this week
- ☆61Updated last week
- AMD’s C++ library for accelerating tensor primitives☆44Updated this week
- oneAPI Collective Communications Library (oneCCL)☆245Updated 2 weeks ago
- ☆58Updated this week
- A Visual Studio Code extension for building and debugging CUDA applications.☆90Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆68Updated this week
- Unified Collective Communication Library☆275Updated last week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆353Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Updated this week
- SYCL Reference Manual☆28Updated last year
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆84Updated this week
- MLIR-based partitioning system☆132Updated this week
- Machine learning for machine code.☆90Updated last month
- Next generation LAPACK implementation for ROCm platform☆111Updated this week
- Bandwidth test for ROCm☆65Updated last week
- oneAPI Specification source files☆208Updated last week
- LLM training in simple, raw C/CUDA☆104Updated last year
- TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)☆46Updated last week
- Stores documents and resources used by the OpenXLA developer community☆129Updated last year
- ☆313Updated 2 months ago
- SYCL-ML is a C++ library, implementing classical machine learning algorithms using SYCL.☆66Updated 5 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆148Updated last week
- Tests and benchmarks for cudnn (and in the future, other nvidia libraries)☆53Updated 4 years ago