nanogpt turned into a chat model
☆81Aug 30, 2023Updated 2 years ago
Alternatives and similar repositories for nanoChatGPT
Users that are interested in nanoChatGPT are comparing it to the libraries listed below
Sorting:
- Simple repository for training small reasoning models☆49Feb 17, 2026Updated 2 weeks ago
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆36Updated this week
- ☆25Oct 7, 2025Updated 4 months ago
- Code for the ACL 2024 paper "PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning"☆14Aug 13, 2025Updated 6 months ago
- Create synthetic datasets from scratch using AI-powered generation. Define topics, customize prompts, and generate high-quality reasoning…☆29Updated this week
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆81Feb 10, 2026Updated 3 weeks ago
- Disambiguation of wikipedia article name☆17Mar 15, 2017Updated 8 years ago
- Common tools for data processing☆22Dec 8, 2025Updated 2 months ago
- Experiments with BitNet inference on CPU☆55Apr 1, 2024Updated last year
- Source code for 2FAS Pass Browser Extension☆49Updated this week
- This is an easy to understand, simplified, broken-down implementation of Diffusion Models written in PyTorch. The architecture is borrowe…☆27Aug 18, 2023Updated 2 years ago
- Educational WIP☆68Feb 16, 2026Updated 2 weeks ago
- Evaluation framework for document processing models and services.☆64Feb 12, 2026Updated 3 weeks ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37May 14, 2024Updated last year
- Russian words synonyms and antonyms☆11Dec 7, 2021Updated 4 years ago
- ☆12Sep 22, 2015Updated 10 years ago
- AI for a cure, a combination of Latent-GAN and VAE-JTNN to create 100% valid drug like molecules☆10Mar 16, 2020Updated 5 years ago
- Transfer Learning for Stenosis Detection in X-ray Coronary Angiography☆13Jul 3, 2021Updated 4 years ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- A Transfer Learning Study of Gas Adsorption in Metal-Organic Frameworks☆14Jul 15, 2020Updated 5 years ago
- An environment where you can try out faster-whisper immediately.☆38Nov 21, 2024Updated last year
- a blog starter project☆11Oct 29, 2018Updated 7 years ago
- LLM Building Blocks for Python Course☆16Nov 17, 2025Updated 3 months ago
- A protein language model for learning the SARS-CoV-2 fitness landscape☆12Apr 22, 2025Updated 10 months ago
- Data profiling tools for Big Data☆11Nov 17, 2025Updated 3 months ago
- Create and modify Word documents with Python☆12Feb 16, 2026Updated 2 weeks ago
- AI model for making mazes that extends OpenAIs GPT2 model☆15Dec 21, 2023Updated 2 years ago
- Like tcpdump for AWS IAM policies☆10Jul 15, 2019Updated 6 years ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- Deep Generative Models: Diffusion Models for Molecule Generation☆10Jun 17, 2024Updated last year
- A simple web-app for generating glassmorphism UI effect!☆12Aug 5, 2023Updated 2 years ago
- Vite + Mantine + Vanilla extract template☆12Feb 16, 2026Updated 2 weeks ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- Conversion of audio files to text using whisper from OpenAI with a simple tkinter GUI☆10Apr 13, 2023Updated 2 years ago
- OpenCode GUI extension for VSCode☆22Feb 11, 2026Updated 3 weeks ago
- Author Name Disambiguation☆10Sep 10, 2021Updated 4 years ago
- yet another reinforcement learning package☆12May 24, 2022Updated 3 years ago
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 8 months ago
- Radix Primitives Cheatsheet☆12Mar 11, 2022Updated 3 years ago