NanoGPT (124M) quality in 2.67B tokens
☆28Sep 17, 2025Updated 7 months ago
Alternatives and similar repositories for modded-nanogpt
Users that are interested in modded-nanogpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- https://hf.co/hexgrad/Kokoro-82M☆14Jan 14, 2026Updated 3 months ago
- Stochastic trace estimation using JAX☆17Aug 20, 2025Updated 8 months ago
- AbBFN2: A flexible antibody foundation model based on Bayesian Flow Networks☆39Jun 4, 2025Updated 11 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 4 months ago
- An introduction to DSPy☆34Aug 30, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Template for projects in PyTorch powered with PyTorch Lightning, Telegrad and MLflow. Get updates on mobile and streamline PyTorch code f…☆10May 1, 2023Updated 3 years ago
- Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…☆32Feb 7, 2025Updated last year
- ☆92Aug 18, 2024Updated last year
- Numerical Optimisation Library☆17Jul 9, 2023Updated 2 years ago
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- ☆45Jun 19, 2024Updated last year
- ☆95Jan 15, 2025Updated last year
- 短视频内容理解与推荐竞赛☆12Feb 18, 2019Updated 7 years ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- bindings to gnuplot (fork of https://bitbucket.org/ogu/gnuplot-ocaml/)☆13May 6, 2024Updated 2 years ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆38Jul 11, 2025Updated 9 months ago
- GAIL implementation using Tensorflow☆14Sep 17, 2019Updated 6 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Dec 21, 2025Updated 4 months ago
- Dockerized openconnect client. Compatible with Cisco Anyconnect (CSD). Exposes socks5 proxy.☆14Oct 16, 2020Updated 5 years ago
- FractionalTransforms.jl: A Julia package aiming at providing fractional order transforms with high performance.☆16Jul 15, 2022Updated 3 years ago
- OCaml bindings for the Integer Set Library.☆13Jun 12, 2014Updated 11 years ago
- ☆40Jul 26, 2024Updated last year
- Obtain options data from Interactive Brokers (IBKR) API☆10Nov 11, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆62Oct 23, 2023Updated 2 years ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- ☆26Jan 23, 2026Updated 3 months ago
- MetaQA: Combining Expert Agents for Multi-Skill Question Answering☆24Mar 13, 2022Updated 4 years ago
- ☆20Dec 14, 2024Updated last year
- ☆10Jan 10, 2025Updated last year
- List of 4000 Chinese characters sorted by historical usage frequency, with Cantonese yale romanization and definition☆14Dec 18, 2022Updated 3 years ago
- ☆22Aug 21, 2025Updated 8 months ago
- A lightweight audio codec based on a single quantizer☆34Sep 4, 2025Updated 8 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- RWKV-7: Surpassing GPT☆104Nov 17, 2024Updated last year
- ☆49Jan 18, 2024Updated 2 years ago
- Chromax is a breeding simulator based on JAX.☆10Jun 6, 2025Updated 11 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28May 4, 2025Updated last year
- ☆21Jun 4, 2024Updated last year
- Build contrasts for models defined with formulaic☆12Apr 27, 2026Updated last week
- ☆12May 30, 2025Updated 11 months ago