haifeng-jin / keras-benchmarksLinks

☆12

Alternatives and similar repositories for keras-benchmarks

Users that are interested in keras-benchmarks are comparing it to the libraries listed below

Sorting:

AakashKumarNain / mistral_jax
This is a port of Mistral-7B model in JAX
☆32Updated last year
jax-ml / ml_dtypes
A stand-alone implementation of several NumPy dtype extensions used in machine learning.
☆281Updated last week
google / jaxonnxruntime
A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.
☆116Updated this week
alexzhang13 / Triton-Puzzles-Solutions
Personal solutions to the Triton Puzzles
☆19Updated last year
drisspg / driss_torch
Cuda extensions for PyTorch
☆11Updated 2 months ago
NVIDIA / JAX-Toolbox
JAX-Toolbox
☆323Updated this week
yixiaoer / einshard
Einsum-like high-level array sharding API for JAX
☆35Updated last year
facebookresearch / GCD
Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594
☆15Updated last year
lianakoleva / no-libtorch-compile
☆21Updated 4 months ago
iree-org / iree-jax
☆52Updated 11 months ago
Quansight / torch-build
Collection of scripts to build PyTorch and the domain libraries from source.
☆12Updated last month
srush / anynp
Proof-of-concept of global switching between numpy/jax/pytorch in a library.
☆18Updated last year
IntelLabs / EquiTriton
EquiTriton is a project that seeks to implement high-performance kernels for commonly used building blocks in equivariant neural networks…
☆62Updated last week
salykova / sgemm.cu
High-Performance SGEMM on CUDA devices
☆97Updated 5 months ago
pytorch-labs / torchfix
TorchFix - a linter for PyTorch-using code with autofix support
☆143Updated 5 months ago
NVIDIA / jaxpp
JaxPP is a library for JAX that enables flexible MPMD pipeline parallelism for large-scale LLM training
☆51Updated last week
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆79Updated last year
intel / intel-extension-for-openxla
☆48Updated last month
facebookresearch / MODel_opt
Memory Optimizations for Deep Learning (ICML 2023)
☆98Updated last year
gevtushenko / llm.c
LLM training in simple, raw C/CUDA
☆99Updated last year
openxla / community
Stores documents and resources used by the OpenXLA developer community
☆126Updated 11 months ago
NERSC / sc22-dl-tutorial
Material for the SC22 Deep Learning at Scale Tutorial
☆41Updated 2 years ago
pytorch / test-infra
This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …
☆96Updated this week
openxla / shardy
MLIR-based partitioning system
☆105Updated this week
axonn-ai / axonn
A parallel framework for training deep neural networks
☆62Updated 4 months ago
pytorch-labs / triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆43Updated 4 months ago
NVIDIA / Fuser
A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
☆343Updated this week
che-shr-cat / Deep_Learning_with_JAX
Notebooks for the "Deep Learning with JAX" book
☆151Updated last month
cgarciae / nnx
Neural Networks for JAX
☆84Updated 9 months ago
KernelTuner / kernel_tuner_tutorial
A hands-on introduction to tuning GPU kernels using Kernel Tuner https://github.com/KernelTuner/kernel_tuner/
☆31Updated 3 months ago