maxbbraun / llama4microLinks

A "large" language model running on a microcontroller

☆533

Alternatives and similar repositories for llama4micro

Users that are interested in llama4micro are comparing it to the libraries listed below

Sorting:

sumo43 / loopvlm
run paligemma in real time
☆131Updated last year
Cerebras / gigaGPT
a small code base for training large models
☆308Updated 3 months ago
adamcohenhillel / LLMs-Cheatsheet
Instructions on how to run LLMs on Raspberry PI
☆208Updated last year
moonshine-ai / useful-transformers
Efficient Inference of Transformer models
☆442Updated last year
Maknee / minigpt4.cpp
Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)
☆568Updated 2 years ago
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆717Updated last year
gavi / mlx-whatsapp
An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning
☆168Updated last year
joey00072 / Tinytorch
A really tiny autograd engine
☆95Updated 2 months ago
trholding / llama2.c
Llama 2 Everywhere (L2E)
☆1,519Updated 6 months ago
yacineMTB / talk
Let's make sand talk
☆592Updated last year
EurekaLabsAI / tensor
The Tensor (or Array)
☆441Updated 11 months ago
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆350Updated last year
dorjeduck / llm.mojo
port of Andrjey Karpathy's llm.c to Mojo
☆354Updated this week
pranavjad / mlx-gpt2
gpt-2 from scratch in mlx
☆394Updated last year
OneInterface / realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
☆382Updated last year
haraschax / nograd
Gradient descent is cool and all, but what if we could delete it?
☆104Updated this week
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆133Updated last year
Laz4rz / GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
☆172Updated last year
ulrichstern / cuda-convnet
Alex Krizhevsky's original code from Google Code
☆195Updated 9 years ago
danielgross / ggml-k8s
Run GGML models with Kubernetes.
☆173Updated last year
omkaark / simple-federated-learning
☆96Updated last year
antirez / gguf-tools
GGUF implementation in C as a library and a tools CLI program
☆277Updated 6 months ago
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆217Updated last year
tairov / llama2.py
Inference Llama 2 in one file of pure Python
☆419Updated 10 months ago
mistralai / megablocks-public
☆864Updated last year
evanmiller / LLM-Reading-List
LLM papers I'm reading, mostly on inference and model compression
☆737Updated last year
a1k0n / a1gpt
throwaway GPT inference
☆140Updated last year
tairov / QStarLearning.mojo
☆111Updated last year
pbelcak / UltraFastBERT
The repository for the code of the UltraFastBERT paper
☆516Updated last year
google-ai-edge / model-explorer
A modern model graph visualizer and debugger
☆1,293Updated last week