Video+code lecture on building nanoGPT from scratch
☆67Jun 14, 2024Updated 2 years ago
Alternatives and similar repositories for build-nanogpt
Users that are interested in build-nanogpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- ☆19Sep 9, 2024Updated last year
- Simple Streamlit UI for Ollama☆22May 13, 2024Updated 2 years ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Jun 25, 2024Updated last year
- A simple library for working with Hugging Face models.☆14Dec 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Jul 21, 2024Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Reference implementation of Mistral AI 7B v0.1 model.☆28Dec 25, 2023Updated 2 years ago
- The Web Metadata Extraction Toolkit is designed to streamline the process of extracting, cleaning, and analyzing metadata from websites. …☆18Jul 8, 2024Updated last year
- ☆21Feb 20, 2023Updated 3 years ago
- Thin wrapper around GGML to make life easier☆46Nov 5, 2025Updated 7 months ago
- ☆17Dec 6, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆12Dec 21, 2024Updated last year
- gpt-2 from scratch in mlx☆432Jun 12, 2024Updated 2 years ago
- Basic rover demo from Raspberry Pi with remote teleop over LiveKit☆18Jul 10, 2025Updated 11 months ago
- RAG application to answer questions about PDF documents using LLMs.☆16Dec 1, 2023Updated 2 years ago
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆135Sep 30, 2024Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- Attempt at cog wrapper for nightmareai/real-esrgan for larger images☆16Sep 28, 2023Updated 2 years ago
- finetune your florence2 model easy☆21Jul 27, 2024Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆42Jul 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- larry.ai: A Batteries Included ChatGPT Frontend Framework & HTTP Proxy☆17Jan 16, 2024Updated 2 years ago
- Simple repository for training small reasoning models☆52Feb 17, 2026Updated 3 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Dec 20, 2024Updated last year
- ☆23May 14, 2026Updated last month
- ☆24Jan 22, 2025Updated last year
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- NanoGPT (124M) in 5 minutes☆15Feb 14, 2025Updated last year
- Never fill a sockaddr_in struct by hand again!☆13Apr 10, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of the TTS model Lina-Speech☆178Jan 9, 2025Updated last year
- AI Search engine☆13Sep 24, 2025Updated 8 months ago
- Find better generation parameters for your LLM☆27Jun 9, 2024Updated 2 years ago
- ☆11Feb 9, 2024Updated 2 years ago
- OllaDeck is a purple technology stack for Generative AI (text modality) cybersecurity. It provides a comprehensive set of tools for both …☆17Sep 21, 2024Updated last year
- WIP: Ofen is a toolkit aimed at making transformer models production-ready. API included☆17Oct 2, 2024Updated last year
- An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'☆56Aug 19, 2024Updated last year