sgl-project / sglang-jaxLinks

JAX backend for SGL

☆175

Alternatives and similar repositories for sglang-jax

Users that are interested in sglang-jax are comparing it to the libraries listed below

Sorting:

ByteDance-Seed / ByteCheckpoint
ByteCheckpoint: An Unified Checkpointing Library for LFMs
☆252Updated 4 months ago
sgl-project / genai-bench
Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…
☆230Updated last week
gpu-mode / ring-attention
ring-attention experiments
☆155Updated last year
meta-pytorch / BackendBench
How to ensure correctness and ship LLM generated kernels in PyTorch
☆121Updated last week
cchan / tccl
extensible collectives library in triton
☆91Updated 7 months ago
fzyzcjy / torch_utils
Utility scripts for PyTorch (e.g. Make Perfetto show some disappearing kernels, Memory profiler that understands more low-level allocatio…
☆68Updated 2 months ago
fzyzcjy / torch_memory_saver
Allow torch tensor memory to be released and resumed later
☆167Updated last week
triton-lang / kernels
☆93Updated last year
Deep-Learning-Profiling-Tools / triton-viz
☆250Updated this week
shawntan / scattermoe
Triton-based implementation of Sparse Mixture of Experts.
☆248Updated last month
tgale96 / grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
☆130Updated 5 months ago
zinccat / Awesome-Triton-Kernels
Collection of kernels written in Triton language
☆167Updated 7 months ago
meta-pytorch / applied-ai
Applied AI experiments and examples for PyTorch
☆305Updated 3 months ago
vllm-project / tpu-inference
TPU inference for vLLM, with unified JAX and PyTorch support.
☆163Updated this week
meta-pytorch / kraken
Triton-based Symmetric Memory operators and examples
☆63Updated last month
ademeure / DeeperGEMM
DeeperGEMM: crazy optimized version
☆73Updated 6 months ago
gpu-mode / triton-index
Cataloging released Triton kernels.
☆267Updated 2 months ago
meta-pytorch / torchcomms
torchcomms: a modern PyTorch communications API
☆291Updated this week
HazyResearch / Megakernels
kernels, of the mega variety
☆608Updated last month
meta-pytorch / tritonbench
Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.
☆286Updated this week
dropbox / gemlite
Fast low-bit matmul kernels in Triton
☆398Updated this week
yifuwang / symm-mem-recipes
☆148Updated 10 months ago
deepseek-ai / LPLB
An early research stage MoE load balancer based on inear programming.
☆228Updated this week
KuangjuX / NVSHMEM-Tutorial
NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer
☆143Updated 2 months ago
Dao-AILab / quack
A Quirky Assortment of CuTe Kernels
☆660Updated 3 weeks ago
NVIDIA / nvshmem
NVIDIA NVSHMEM is a parallel programming interface for NVIDIA GPUs based on OpenSHMEM. NVSHMEM can significantly reduce multi-process com…
☆385Updated last week
infinigence / FlashOverlap
A lightweight design for computation-communication overlap.
☆187Updated last month
stepfun-ai / StepMesh
☆316Updated last week
InternLM / turbomind
☆97Updated 7 months ago
stanford-futuredata / stk
☆113Updated last year