ash-01xor / bpe.cView external linksLinks
Simple Byte pair Encoding mechanism used for tokenization process . written purely in C
☆146Nov 11, 2024Updated last year
Alternatives and similar repositories for bpe.c
Users that are interested in bpe.c are comparing it to the libraries listed below
Sorting:
- UNet diffusion model in pure CUDA☆661Jun 28, 2024Updated last year
- Fast bare-bones BPE for modern tokenizer training☆175Jun 23, 2025Updated 7 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆306Jun 11, 2024Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆248Feb 24, 2025Updated 11 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆596Aug 12, 2025Updated 6 months ago
- Simple MPI implementation for prototyping or learning☆300Aug 6, 2025Updated 6 months ago
- coded with and corrected by Google Anti-Gravity☆13Nov 23, 2025Updated 2 months ago
- A lightweight library for portable low-level GPU computation using WebGPU.☆3,944Oct 8, 2025Updated 4 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,762Apr 18, 2025Updated 9 months ago
- My favorite C programming practices.☆2,149Jan 19, 2026Updated 3 weeks ago
- useful scripts to work with Twitter + Python. Requires the tweepy library.☆85Nov 29, 2012Updated 13 years ago
- bare minimum chess program☆11Sep 16, 2020Updated 5 years ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆2,076Aug 26, 2025Updated 5 months ago
- A Golang application that demonstrates how to monitor a Golang service using Prometheus and Grafana. This is for Docker's official Deno L…☆15Mar 22, 2025Updated 10 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,042Apr 27, 2025Updated 9 months ago
- NanoGPT (124M) in 2 minutes☆4,624Updated this week
- gpt-2 from scratch in mlx☆415Jun 12, 2024Updated last year
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆334Nov 2, 2025Updated 3 months ago
- A python script to help manage a Gmail inbox by filtering out promotional emails using GPT-3 or GPT-4.☆458Dec 2, 2023Updated 2 years ago
- ☆18Jul 12, 2025Updated 7 months ago
- kenDryte K210 Cloud Build Support☆11Oct 24, 2018Updated 7 years ago
- Tile primitives for speedy kernels☆3,139Updated this week
- Simple implementation of a GPT (training and inference) in PyTorch.☆13Dec 11, 2023Updated 2 years ago
- ☆16Dec 19, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Schedule-Free Optimization in PyTorch☆2,256May 21, 2025Updated 8 months ago
- Finetuning VITS Efficiently☆33Nov 6, 2023Updated 2 years ago
- The code for the paper *The Sensitivity of Counterfactual Fairness to Unmeasured Confounding* @ UAI 2019☆14Apr 4, 2020Updated 5 years ago
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆53Sep 20, 2025Updated 4 months ago
- Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker Diarization System with AutoEncoder based clustering method. Also su…☆16Jun 2, 2021Updated 4 years ago
- RAG application to answer questions about PDF documents using LLMs.☆14Dec 1, 2023Updated 2 years ago
- JavaScript with Batteries Included for Google Glass☆218Jul 10, 2016Updated 9 years ago
- SkyScenes: A Synthetic Dataset for Aerial Scene Understanding☆22Sep 25, 2024Updated last year
- A PyTorch native platform for training generative AI models☆5,045Feb 8, 2026Updated last week
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- ☆16Dec 18, 2023Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- Run PyTorch LLMs locally on servers, desktop and mobile☆3,624Sep 10, 2025Updated 5 months ago
- implementing dl from scratch using first principles☆25Jan 10, 2026Updated last month