hyhieu / easy_pybind
☆33Updated 3 months ago
Related projects: ⓘ
- ☆68Updated 2 months ago
- ☆53Updated 8 months ago
- ☆82Updated 6 months ago
- ☆124Updated 7 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆94Updated 2 weeks ago
- Scalable neural net training via automatic normalization in the modular norm.☆108Updated last month
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆110Updated 5 months ago
- Experiment of using Tangent to autodiff triton☆66Updated 7 months ago
- JAX bindings for Flash Attention v2☆76Updated 2 months ago
- Solve puzzles. Learn CUDA.☆53Updated 9 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆85Updated last month
- WIP☆76Updated last month
- σ-GPT: A New Approach to Autoregressive Models☆53Updated last month
- Clarity: A Minimalist Website Template for AI Research☆36Updated 3 weeks ago
- ring-attention experiments☆89Updated 5 months ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- ☆48Updated 3 months ago
- Gpu benchmark☆35Updated 2 weeks ago
- ☆124Updated last week
- Visualizations of the theory behind diffusion models.☆63Updated 5 months ago
- Collection of autoregressive model implementation☆62Updated 2 weeks ago
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu