nanogpt turned into a chat model
☆82Aug 30, 2023Updated 2 years ago
Alternatives and similar repositories for nanoChatGPT
Users that are interested in nanoChatGPT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆45Mar 31, 2026Updated 2 months ago
- Simple repository for training small reasoning models☆52Feb 17, 2026Updated 3 months ago
- ☆10Jun 8, 2024Updated 2 years ago
- Particle Syntax Website☆16Apr 12, 2026Updated 2 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆74Apr 22, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick☆295Nov 25, 2023Updated 2 years ago
- a tutorial for training a PyTorch transformer from scratch☆26Apr 8, 2024Updated 2 years ago
- ☆11Aug 9, 2022Updated 3 years ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆110Feb 10, 2026Updated 4 months ago
- GitHub issues as a blog with comment feature, place to publish and/or relay contents, and open discussion forum.☆15Jun 26, 2020Updated 5 years ago
- Omeka S module that maps a site to a domain☆10Feb 1, 2023Updated 3 years ago
- ☆26Jan 15, 2026Updated 5 months ago
- A SKOS browser and editor☆12Feb 5, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Ollama chat using Google Mesop library☆20Jun 25, 2024Updated last year
- Transfer Learning for Stenosis Detection in X-ray Coronary Angiography☆13Jul 3, 2021Updated 4 years ago
- Experiments with BitNet inference on CPU☆57Apr 1, 2024Updated 2 years ago
- ☆10Apr 21, 2024Updated 2 years ago
- This project aims to provide a high effective KV cache manage framework for llm inference and improve memory utilization and inference sp…☆61Apr 24, 2026Updated last month
- Dockerfiles for Avalon Media System - http://github.com/avalonmediasystem/avalon☆10Jun 2, 2026Updated 2 weeks ago
- Ollama API client in ECMAScript / JavaScript / ESM.☆11Sep 18, 2023Updated 2 years ago
- Squebi provides an extendable SPARQL interface.☆22May 27, 2015Updated 11 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Mar 6, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Support for the researches on AI Education (AIED), AI Tutor, Adaptive Learning, Intelligent Tutoring System☆12Aug 8, 2019Updated 6 years ago
- MMSEG simple word segmenter in C++ 11☆18Jul 19, 2014Updated 11 years ago
- 사전에서 대화 예문만 추출한 데이터☆16Apr 24, 2023Updated 3 years ago
- It is almost the best 3B model in the current open source industry, surpassing Dolly v2-3b, open lama-3b, and even outperforming the Eleu…☆15Jul 24, 2023Updated 2 years ago
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 8 months ago
- MacBook Pro keyboard written in SwiftUI.☆12Jan 19, 2021Updated 5 years ago
- RWKV in nanoGPT style☆197Jun 9, 2024Updated 2 years ago
- AI for a cure, a combination of Latent-GAN and VAE-JTNN to create 100% valid drug like molecules☆10Mar 16, 2020Updated 6 years ago
- A handy plugin for copying requests/responses directly from Burp, some extra magic included.☆13Oct 15, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Educational WIP☆73Feb 16, 2026Updated 4 months ago
- ☆10Jul 28, 2021Updated 4 years ago
- Vite + Mantine + Vanilla extract template☆12Jun 10, 2026Updated last week
- ☆11Apr 20, 2023Updated 3 years ago
- 2019 국어경진대회 한국어 의존구문 분석 대상(문체부 장관상)☆15Oct 26, 2022Updated 3 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 3 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated 2 years ago