wiegerw / nerva
C++ and Python libraries for neural networks.
☆14Updated 5 months ago
Alternatives and similar repositories for nerva:
Users that are interested in nerva are comparing it to the libraries listed below
- Experiment of using Tangent to autodiff triton☆78Updated last year
- Triton Implementation of HyperAttention Algorithm☆47Updated last year
- ☆52Updated 6 months ago
- ☆13Updated last week
- ☆13Updated last year
- Prototype routines for GPU quantization written using PyTorch.☆20Updated last month
- Implementation of Hyena Hierarchy in JAX☆10Updated last year
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Updated 8 months ago
- Make triton easier☆47Updated 9 months ago
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆68Updated 10 months ago
- ☆21Updated last month
- ☆43Updated last year
- FlexAttention w/ FlashAttention3 Support☆26Updated 5 months ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.☆45Updated 8 months ago
- ☆49Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆18Updated 3 weeks ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆19Updated 8 months ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated last year
- JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)☆33Updated 11 months ago
- A Data-Centric Compiler for Machine Learning☆82Updated last year
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆125Updated 4 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆108Updated 3 months ago
- PyTorch centric eager mode debugger☆46Updated 3 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last month
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆28Updated 4 years ago
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- Supplementary material for our paper "Compute Trends Across Three Eras of Machine Learning".☆40Updated 3 years ago
- Explore training for quantized models☆17Updated 2 months ago