google / tslLinks
☆102Updated this week
Alternatives and similar repositories for tsl
Users that are interested in tsl are comparing it to the libraries listed below
Sorting:
- A lightweight memory allocator for hardware-accelerated machine learning☆179Updated 3 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆124Updated 2 weeks ago
- Open source cross-platform compiler for compute-intensive loops used in AI algorithms, from Microsoft Research☆115Updated 2 years ago
- oneAPI Collective Communications Library (oneCCL)☆252Updated 3 weeks ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆368Updated this week
- MLIR-based partitioning system☆157Updated this week
- ☆16Updated 2 years ago
- ☆320Updated 3 weeks ago
- oneAPI Specification source files☆209Updated 3 weeks ago
- SYCL Reference Manual☆29Updated last year
- A Visual Studio Code extension for building and debugging CUDA applications.☆95Updated last week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Updated this week
- AMD’s C++ library for accelerating tensor primitives☆47Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆69Updated this week
- ☆130Updated 2 weeks ago
- A GPU-driven system framework for scalable AI applications☆123Updated 11 months ago
- Tests and benchmarks for cudnn (and in the future, other nvidia libraries)☆53Updated 5 years ago
- ☆423Updated last week
- RAPIDS Memory Manager☆670Updated this week
- Unified Collective Communication Library☆286Updated this week
- ☆85Updated last week
- ☆59Updated 3 weeks ago
- Stores documents and resources used by the OpenXLA developer community☆131Updated last year
- Intel® Extension for MLIR. A staging ground for MLIR dialects and tools for Intel devices using the MLIR toolchain.☆147Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆64Updated this week
- Machine learning for machine code.☆94Updated 2 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆133Updated this week
- ☆63Updated last month
- ☆44Updated this week
- LLM training in simple, raw C/CUDA☆109Updated last year