nanogpt turned into a chat model
☆81Aug 30, 2023Updated 2 years ago
Alternatives and similar repositories for nanoChatGPT
Users that are interested in nanoChatGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆45Mar 31, 2026Updated last month
- ☆26Apr 10, 2026Updated last month
- Simple repository for training small reasoning models☆51Feb 17, 2026Updated 3 months ago
- ☆10Jun 8, 2024Updated last year
- Disambiguation of wikipedia article name☆17Mar 15, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- a tutorial for training a PyTorch transformer from scratch☆26Apr 8, 2024Updated 2 years ago
- ☆11Aug 9, 2022Updated 3 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆105Feb 10, 2026Updated 3 months ago
- ☆26Jan 15, 2026Updated 4 months ago
- A SKOS browser and editor☆12Feb 5, 2020Updated 6 years ago
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆17Dec 1, 2023Updated 2 years ago
- Transfer Learning for Stenosis Detection in X-ray Coronary Angiography☆13Jul 3, 2021Updated 4 years ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆25Jun 6, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A simple way to expose static assets as a read-only Linked Data server☆12Jan 2, 2026Updated 4 months ago
- ESP32-C3 Dev Board for Arduino Community☆12Jul 3, 2025Updated 10 months ago
- Create synthetic datasets from scratch using AI-powered generation. Define topics, customize prompts, and generate high-quality reasoning…☆31Mar 18, 2026Updated 2 months ago
- Data type isomorphic to α ∨ β ∨ (α ∧ β)☆14Apr 27, 2022Updated 4 years ago
- 사전에서 대화 예문만 추출한 데이터☆16Apr 24, 2023Updated 3 years ago
- It is almost the best 3B model in the current open source industry, surpassing Dolly v2-3b, open lama-3b, and even outperforming the Eleu…☆15Jul 24, 2023Updated 2 years ago
- microKanren sagittarius/larceny☆11Jun 13, 2015Updated 10 years ago
- Various samples of how to use the Handle.net Registry☆11Aug 20, 2024Updated last year
- ☆34May 20, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 7 months ago
- Fast modular code to create and train cutting edge LLMs☆67May 16, 2024Updated 2 years ago
- Prolog implemented in Python☆12Sep 6, 2024Updated last year
- grep for context, not just text. Local-first CLI for searching documents, notes, memories, and project context.☆27Mar 8, 2026Updated 2 months ago
- Stable Diffusion with Core ML on Apple Silicon☆12Sep 27, 2023Updated 2 years ago
- MacBook Pro keyboard written in SwiftUI.☆12Jan 19, 2021Updated 5 years ago
- Simple implementation of a GPT (training and inference) in PyTorch.☆13Dec 11, 2023Updated 2 years ago
- RWKV in nanoGPT style☆196Jun 9, 2024Updated last year
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Visual Studio Code extension for viewing files and folders in the workspace with size and the estimated gzip size.☆16May 23, 2021Updated 5 years ago
- AI for a cure, a combination of Latent-GAN and VAE-JTNN to create 100% valid drug like molecules☆10Mar 16, 2020Updated 6 years ago
- ML from scratch in Jax☆12Aug 20, 2025Updated 9 months ago
- A copy of the DirectX Headers from MinGW-64.☆14Sep 7, 2023Updated 2 years ago
- ☆10Jul 28, 2021Updated 4 years ago
- Educational WIP☆70Feb 16, 2026Updated 3 months ago
- Vite + Mantine + Vanilla extract template☆12May 14, 2026Updated last week