triton-lang / triton
Development repository for the Triton language and compiler
☆13,443Updated this week
Related projects ⓘ
Alternatives and complementary repositories for triton
- Fast and memory-efficient exact attention☆14,279Updated this week
- Ongoing research training transformer models at scale☆10,595Updated this week
- Transformer related optimization, including BERT, GPT☆5,890Updated 7 months ago
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,561Updated 3 weeks ago
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆2,710Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆8,660Updated this week
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆7,958Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆6,299Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆30,532Updated this week
- Flax is a neural network library for JAX that is designed for flexibility.☆6,142Updated this week
- CUDA Templates for Linear Algebra Subroutines☆5,679Updated this week
- Open standard for machine learning interoperability☆17,949Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆11,798Updated this week
- Tensor library for machine learning☆11,233Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆30,423Updated this week
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆8,348Updated this week
- PyTorch extensions for high performance and large scale training.☆3,195Updated last week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆10,820Updated 2 weeks ago
- Train transformer language models with reinforcement learning.☆10,086Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆12,427Updated last month
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆20,199Updated 3 months ago
- Large Language Model Text Generation Inference☆9,122Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆12,150Updated this week
- Inference Llama 2 in one file of pure C☆17,476Updated 3 months ago