haifeng-jin / keras-benchmarksLinks
☆12Updated last year
Alternatives and similar repositories for keras-benchmarks
Users that are interested in keras-benchmarks are comparing it to the libraries listed below
Sorting:
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆301Updated this week
- This is a port of Mistral-7B model in JAX☆32Updated last year
- Cuda extensions for PyTorch☆11Updated 5 months ago
- Collection of scripts to build PyTorch and the domain libraries from source.☆12Updated last month
- Notes and artifacts from the ONNX steering committee☆26Updated this week
- Additional multi-backend functionality for Keras 3.☆16Updated last year
- ☆21Updated 7 months ago
- A user-friendly tool chain that enables the seamless execution of ONNX models using JAX as the backend.☆122Updated 3 weeks ago
- JAX-Toolbox☆348Updated this week
- This repository hosts code that supports the testing infrastructure for the PyTorch organization. For example, this repo hosts the logic …☆102Updated this week
- ☆52Updated last year
- LLM training in simple, raw C/CUDA☆105Updated last year
- TorchFix - a linter for PyTorch-using code with autofix support☆148Updated last month
- Neural Networks for JAX☆84Updated last year
- High-Performance SGEMM on CUDA devices☆107Updated 8 months ago
- jax-triton contains integrations between JAX and OpenAI Triton☆426Updated this week
- Experiment of using Tangent to autodiff triton☆80Updated last year
- Benchmarks of different devices I have come across☆33Updated last month
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆144Updated 6 months ago
- ☆115Updated last month
- Memory Optimizations for Deep Learning (ICML 2023)☆108Updated last year
- Tokamax: A GPU and TPU kernel library.☆87Updated this week
- ☆15Updated last week
- Parallel framework for training and fine-tuning deep neural networks☆65Updated 6 months ago
- JMP is a Mixed Precision library for JAX.☆207Updated 8 months ago
- Material for the SC22 Deep Learning at Scale Tutorial☆41Updated 2 years ago
- ☆189Updated 2 weeks ago
- Repository of machine learning benchmarks☆42Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆357Updated this week
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆16Updated last week