Jaykef / micrograd.c

Port of Karpathy's micrograd in pure C. Micrograd is a tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API.

☆25

Related projects: ⓘ

character-ai / MuKoe
☆50Updated 4 months ago
kroggen / mamba.c
Inference of Mamba models in pure C
☆176Updated 6 months ago
golololologol / LLM-Distillery
A pipeline for LLM knowledge distillation
☆68Updated last month
jadechip / nanoXLSTM
The simplest, fastest repository for training/finetuning medium-sized xLSTMs.
☆38Updated 3 months ago
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆64Updated 3 months ago
mag- / gpu_benchmark
Gpu benchmark
☆35Updated 2 weeks ago
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆97Updated 4 months ago
andrew-silva / mlx-rlhf
An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.
☆21Updated 2 months ago
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆118Updated 2 months ago
rejunity / tiny-asic-1_58bit-matrix-mul
Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit
☆103Updated 5 months ago
teknium1 / ShareGPT-Builder
☆101Updated 6 months ago
Chillee / llm.c
LLM training in simple, raw C/CUDA
☆17Updated 4 months ago
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆47Updated 3 months ago
AmeyaWagh / llama2.cpp
Inference Llama 2 in C++
☆47Updated 4 months ago
geronimi73 / phi2-finetune
☆85Updated 7 months ago
vithursant / nanoGPT_mlx
Port of Andrej Karpathy's nanoGPT to Apple MLX framework.
☆96Updated 7 months ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆117Updated 8 months ago
armed-gpt / gpt-blazing
☆25Updated this week
rafacelente / bllama
1.58-bit LLaMa model
☆77Updated 5 months ago
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆97Updated 9 months ago
fairydreaming / farel-bench
Testing LLM reasoning abilities with family relationship quizzes.
☆40Updated 2 weeks ago
flawedmatrix / mamba-ssm
Implementation of mamba with rust
☆69Updated 6 months ago
evintunador / minGemma
a simplified version of Google's Gemma model to be used for learning
☆22Updated 6 months ago
cognitivecomputations / grokadamw
☆109Updated last month
tanaymeh / mamba-train
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆46Updated 5 months ago
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers model
☆154Updated 4 months ago
deepshard / mixtral-8x7b-Inference
Eh, simple and works.
☆27Updated 9 months ago
schwartz-lab-NLP / TOVA
Token Omission Via Attention
☆118Updated 7 months ago
austinsilveria / tricksy
Fast approximate inference on a single GPU with sparsity aware offloading
☆39Updated 8 months ago
Contextualist / lone-arena
Self-hosted LLM chatbot arena, with yourself as the only judge
☆36Updated 7 months ago