This repo is my attempt at a rough implementation of nanoGPT trained on a dataset of 30,000 unique Twitter usernames
☆23Apr 7, 2024Updated 2 years ago
Alternatives and similar repositories for GPT-Scratch
Users that are interested in GPT-Scratch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 7am delivers daily weather summary to you at 7am☆22Jun 20, 2025Updated 11 months ago
- Lydia: Who's Your Enemy in the Dark Forrest☆13Aug 24, 2025Updated 9 months ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated 2 years ago
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code☆10Aug 29, 2023Updated 2 years ago
- Programmatically find the storage slots for the balanceOf and allowance mappings for an ERC20 token contract in javascript☆20Nov 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AniPortrait with Gradio: Audio-Driven Synthesis of Photorealistic Portrait Animation☆22Mar 31, 2024Updated 2 years ago
- The Transcendent Progressivism (t/prog) Manifesto: A Vision for Humanity's Future☆41May 27, 2025Updated 11 months ago
- Source code for Activated LoRA☆25Apr 30, 2026Updated 3 weeks ago
- parallelized hyperdimensional tictactoe☆127Aug 25, 2024Updated last year
- High Quality Resources on GPU Programming/Architecture☆592Jul 26, 2024Updated last year
- PyTorch DL Tutorial using Torchsample☆11May 2, 2017Updated 9 years ago
- Collection of reports/articles/publications/etc of mine.☆50May 26, 2022Updated 3 years ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆202Sep 24, 2023Updated 2 years ago
- inference code for mixtral-8x7b-32kseqlen☆104Dec 12, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ❓ List of performance engineering questions for performance engineers.☆20Oct 8, 2020Updated 5 years ago
- Build modern UIs in Jupyter with Python☆12Dec 28, 2022Updated 3 years ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- nvidia-smi xml to json☆15May 29, 2024Updated last year
- Blogging with Emacs and AI☆11Jun 4, 2023Updated 2 years ago
- Annoucing Instructor Cloud☆38Aug 14, 2024Updated last year
- ☆134Nov 24, 2023Updated 2 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 7 months ago
- ☆12Jul 10, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Confidential inference in enclave for OpenAI grant. Uses k3s and Triton☆16Mar 20, 2025Updated last year
- ☆67Dec 8, 2023Updated 2 years ago
- upload a manim script and generate an animation☆11Mar 10, 2024Updated 2 years ago
- A spatial terminal multiplexer for macOS. Terminals live on an infinite canvas that you can pan, zoom, and arrange freely.☆42Apr 2, 2026Updated last month
- Queue system for dispatching FFmpeg jobs, used for @uwutube, powered by @fastify and @redis☆10Feb 12, 2022Updated 4 years ago
- ☆16Mar 23, 2023Updated 3 years ago
- papers.day☆93Dec 15, 2023Updated 2 years ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- ☆17Apr 20, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An ERC721 for resurrecting the dead.☆17Oct 8, 2021Updated 4 years ago
- A GPT with self-similar nested properties☆20Mar 19, 2024Updated 2 years ago
- A collection of puzzles I've created over the years☆14Dec 31, 2020Updated 5 years ago
- Notes for CS294/194-196: Large Language Model Agents (Fall 2024, UC Berkeley), summarizing 12 lectures on LLM fundamentals, reasoning, pl…☆17Jan 7, 2025Updated last year
- utilities to facilitate working with codebases that don't ascribe to normal package management paradigms, e.g. ML research code that can …☆13Nov 26, 2022Updated 3 years ago
- ☆25May 23, 2025Updated last year
- Allows native usage of ModelScope based Text To Video Models in ComfyUI☆27May 23, 2024Updated 2 years ago