☆576Jul 11, 2024Updated last year
Alternatives and similar repositories for HighPerfLLMs2024
Users that are interested in HighPerfLLMs2024 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple, performant and scalable Jax LLM!☆2,255Updated this week
- Minimal yet performant LLM examples in pure JAX☆251Apr 10, 2026Updated 3 weeks ago
- ☆308Jul 15, 2024Updated last year
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆359Updated this week
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆704Jan 26, 2026Updated 3 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆431Jan 5, 2026Updated 3 months ago
- ☆353Apr 13, 2026Updated 2 weeks ago
- Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimenta…☆549Apr 23, 2026Updated last week
- Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs☆935Mar 15, 2026Updated last month
- Package of Pathways-on-Cloud utilities☆27Apr 24, 2026Updated last week
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 5 months ago
- A simplified and automated orchestration workflow to perform ML end-to-end (E2E) model tests and benchmarking on Cloud VMs across differe…☆61Updated this week
- Train very large language models in Jax.☆209Oct 21, 2023Updated 2 years ago
- JAX-Toolbox☆404Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 5 months ago
- ☆342Updated this week
- seqax = sequence modeling + JAX☆188Jul 23, 2025Updated 9 months ago
- ☆28Jun 3, 2024Updated last year
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆180Apr 21, 2026Updated last week
- torchprime is a reference model implementation for PyTorch on TPU.☆47Mar 3, 2026Updated last month
- ☆94Updated this week
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 9 months ago
- ☆93Jul 5, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- jax-triton contains integrations between JAX and OpenAI Triton☆447Apr 23, 2026Updated last week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,520Aug 13, 2024Updated last year
- JAX Synergistic Memory Inspector☆187Jul 16, 2024Updated last year
- Tokamax: A GPU and TPU kernel library.☆205Apr 23, 2026Updated last week
- JAX bindings for Flash Attention v2☆105Feb 28, 2026Updated 2 months ago
- ☆35Updated this week
- GPU programming related news and material links☆2,114Mar 8, 2026Updated last month
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- What would you do with 1000 H100s...☆1,169Jan 10, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆600Aug 12, 2025Updated 8 months ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆122Dec 29, 2025Updated 4 months ago
- ☆291Apr 20, 2026Updated last week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,159Aug 26, 2025Updated 8 months ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 8 months ago
- A JAX research toolkit for building, editing, and visualizing neural networks.☆1,881Jun 22, 2025Updated 10 months ago
- Helpful tools and examples for working with flex-attention☆1,179Apr 13, 2026Updated 2 weeks ago