qrsch / doubutsu
☆23Updated 9 months ago
Alternatives and similar repositories for doubutsu:
Users that are interested in doubutsu are comparing it to the libraries listed below
- A really tiny autograd engine☆92Updated last year
- Ultra low overhead NVIDIA GPU telemetry plugin for telegraf with memory temperature readings.☆62Updated 9 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 8 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆98Updated 2 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆63Updated 2 weeks ago
- Collection of autoregressive model implementation☆85Updated last week
- ☆27Updated 9 months ago
- ☆13Updated 10 months ago
- supporting pytorch FSDP for optimizers☆80Updated 4 months ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆65Updated last month
- Focused on fast experimentation and simplicity☆71Updated 4 months ago
- ☆49Updated last year
- This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames☆24Updated last year
- ☆21Updated 5 months ago
- prime-rl is a codebase for decentralized RL training at scale☆85Updated this week
- Simplex Random Feature attention, in PyTorch☆74Updated last year
- Simple Transformer in Jax☆136Updated 10 months ago
- ☆58Updated last year
- research impl of Native Sparse Attention (2502.11089)☆53Updated 2 months ago
- A miniature version of Modal☆20Updated 10 months ago
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆92Updated 9 months ago
- ☆61Updated last year
- ☆78Updated 10 months ago
- WIP☆93Updated 8 months ago
- ☆22Updated last year
- Lego for GRPO☆27Updated last month
- compute, storage, and networking infra at home☆65Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 6 months ago
- An introduction to LLM Sampling☆77Updated 4 months ago
- look how they massacred my boy☆63Updated 6 months ago