NanoGPT (124M) quality in 2.67B tokens
☆28Sep 17, 2025Updated 6 months ago
Alternatives and similar repositories for modded-nanogpt
Users that are interested in modded-nanogpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- https://hf.co/hexgrad/Kokoro-82M☆14Jan 14, 2026Updated 2 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 3 months ago
- Qt-like event loops, signals and slots for communication across threads and processes in Python☆14Mar 26, 2024Updated 2 years ago
- ☆17May 15, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- JAX Implementations of Descript Audio Codec and EnCodec☆34Mar 30, 2025Updated 11 months ago
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆32Feb 7, 2025Updated last year
- The Cosmos numerical relativity code (with unstructured AMR)☆21Apr 12, 2024Updated last year
- ☆91Aug 18, 2024Updated last year
- ☆21Sep 3, 2024Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- ☆95Jan 15, 2025Updated last year
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- 短视频内容理解 与推荐竞赛☆12Feb 18, 2019Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda☆18Jan 6, 2026Updated 2 months ago
- A scheduler independent blocking mechanism☆19Feb 15, 2024Updated 2 years ago
- bindings to gnuplot (fork of https://bitbucket.org/ogu/gnuplot-ocaml/)☆13May 6, 2024Updated last year
- GAIL implementation using Tensorflow☆14Sep 17, 2019Updated 6 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Dec 21, 2025Updated 3 months ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆123Aug 22, 2024Updated last year
- Algorithms and datastructures for phylogenetics☆14Dec 24, 2025Updated 3 months ago
- FractionalTransforms.jl: A Julia package aiming at providing fractional order transforms with high performance.☆16Jul 15, 2022Updated 3 years ago
- OCaml bindings for the Integer Set Library.☆13Jun 12, 2014Updated 11 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆40Jul 26, 2024Updated last year
- Fast vectorized bitarrays for OCaml☆16Jul 11, 2023Updated 2 years ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆62Oct 23, 2023Updated 2 years ago
- MetaQA: Combining Expert Agents for Multi-Skill Question Answering☆23Mar 13, 2022Updated 4 years ago
- Make triton easier☆50Jun 12, 2024Updated last year
- ☆20Dec 14, 2024Updated last year
- List of 4000 Chinese characters sorted by historical usage frequency, with Cantonese yale romanization and definition☆14Dec 18, 2022Updated 3 years ago
- A VS Code extension to ease log reading and analysis☆11Jan 23, 2024Updated 2 years ago
- A lightweight audio codec based on a single quantizer☆34Sep 4, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- RWKV-7: Surpassing GPT☆104Nov 17, 2024Updated last year
- Co-operative allocation of domains for OCaml☆15Jan 26, 2023Updated 3 years ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated 10 months ago
- OCaml library to produce vega-lite visualizations (as json objects)☆17Aug 17, 2022Updated 3 years ago
- LLM training in simple, raw C/CUDA☆112May 1, 2024Updated last year
- ☆10Jun 15, 2023Updated 2 years ago
- Build contrasts for models defined with formulaic☆12Updated this week