mikex86 / tritonc
Standalone commandline CLI tool for compiling Triton kernels
☆15Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for tritonc
- Make triton easier☆41Updated 5 months ago
- ☆17Updated last month
- Explore training for quantized models☆10Updated 2 weeks ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆44Updated 2 weeks ago
- Experiment of using Tangent to autodiff triton☆72Updated 10 months ago
- ☆15Updated 8 months ago
- FlexAttention w/ FlashAttention3 Support☆27Updated last month
- Jax like function transformation engine but micro, microjax☆26Updated 3 weeks ago
- RWKV model implementation☆38Updated last year
- Latent Large Language Models☆16Updated 3 months ago
- RWKV-7: Surpassing GPT☆47Updated last week
- Hacks for PyTorch☆17Updated last year
- Rust bindings for CTranslate2☆13Updated last year
- ☆18Updated 7 months ago
- Experiments with BitNet inference on CPU☆50Updated 7 months ago
- A tracing JIT compiler for PyTorch☆12Updated 2 years ago
- Compression for Foundation Models☆19Updated 3 weeks ago
- ☆36Updated 2 years ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization, with PyTorch/CUDA☆35Updated 8 months ago
- TORCH_LOGS parser for PT2☆22Updated this week
- ☆26Updated last year
- train with kittens!☆49Updated 3 weeks ago
- benchmarking some transformer deployments☆26Updated last year
- Utilities for Training Very Large Models☆56Updated last month
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆29Updated 3 weeks ago
- ☆20Updated 2 months ago
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆35Updated 4 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆37Updated 7 months ago