haifeng-jin / keras-benchmarksLinks
☆12Updated last year
Alternatives and similar repositories for keras-benchmarks
Users that are interested in keras-benchmarks are comparing it to the libraries listed below
Sorting:
- This is a port of Mistral-7B model in JAX☆32Updated last year
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆281Updated last week
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆116Updated this week
- Personal solutions to the Triton Puzzles☆19Updated last year
- Cuda extensions for PyTorch☆11Updated 2 months ago
- JAX-Toolbox☆323Updated this week
- Einsum-like high-level array sharding API for JAX☆35Updated last year
- Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594☆15Updated last year
- ☆21Updated 4 months ago
- ☆52Updated 11 months ago
- Collection of scripts to build PyTorch and the domain libraries from source.☆12Updated last month
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated last year
- EquiTriton is a project that seeks to implement high-performance kernels for commonly used building blocks in equivariant neural networks…☆62Updated last week
- High-Performance SGEMM on CUDA devices☆97Updated 5 months ago
- TorchFix - a linter for PyTorch-using code with autofix support☆143Updated 5 months ago
- JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training☆51Updated last week
- Experiment of using Tangent to autodiff triton☆79Updated last year
- ☆48Updated last month
- Memory Optimizations for Deep Learning (ICML 2023)☆98Updated last year
- LLM training in simple, raw C/CUDA☆99Updated last year
- Stores documents and resources used by the OpenXLA developer community☆126Updated 11 months ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Updated 2 years ago
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆96Updated this week
- MLIR-based partitioning system☆105Updated this week
- A parallel framework for training deep neural networks☆62Updated 4 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆43Updated 4 months ago
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆343Updated this week
- Notebooks for the "Deep Learning with JAX" book☆151Updated last month
- Neural Networks for JAX☆84Updated 9 months ago
- A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/☆31Updated 3 months ago