qrsch / doubutsuLinks
☆24Updated 11 months ago
Alternatives and similar repositories for doubutsu
Users that are interested in doubutsu are comparing it to the libraries listed below
Sorting:
- Collection of autoregressive model implementation☆85Updated 2 months ago
- A really tiny autograd engine☆94Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆101Updated 4 months ago
- DeMo: Decoupled Momentum Optimization☆189Updated 7 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆66Updated 3 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆68Updated 2 months ago
- Simple Transformer in Jax☆138Updated last year
- Cerule - A Tiny Mighty Vision Model☆66Updated 10 months ago
- Focused on fast experimentation and simplicity☆76Updated 6 months ago
- ☆49Updated last year
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆63Updated last month
- Google TPU optimizations for transformers models☆114Updated 5 months ago
- ☆79Updated last year
- ☆134Updated 10 months ago
- working implimention of deepseek MLA☆42Updated 6 months ago
- Implementation of the Llama architecture with RLHF + Q-learning☆165Updated 5 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆96Updated last month
- supporting pytorch FSDP for optimizers☆82Updated 7 months ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆97Updated this week
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- ☆27Updated last year
- WIP☆93Updated 11 months ago
- research impl of Native Sparse Attention (2502.11089)☆54Updated 4 months ago
- SIMD quantization kernels☆73Updated last week
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆198Updated 11 months ago
- ☆46Updated 3 months ago
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆62Updated last year
- ☆40Updated last month
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 8 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆188Updated last month