maxbbraun / llama4micro
A "large" language model running on a microcontroller
☆496Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for llama4micro
- run paligemma in real time☆123Updated 6 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆148Updated 6 months ago
- Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.☆351Updated 3 months ago
- Port of MiniGPT4 in C++ (4bit, 5bit, 6bit, 8bit, 16bit CPU inference with GGML)☆560Updated last year
- Alex Krizhevsky's original code from Google Code☆190Updated 8 years ago
- LLM-powered lossless compression tool☆252Updated 3 months ago
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆159Updated 10 months ago
- Open weights language model from Google DeepMind, based on Griffin.☆607Updated 4 months ago
- A modern model graph visualizer and debugger☆1,060Updated this week
- CLIP inference in plain C/C++ with no extra dependencies☆462Updated 3 months ago
- Llama 2 Everywhere (L2E)☆1,513Updated last month
- An implementation of bucketMul LLM inference☆214Updated 4 months ago
- The repository for the code of the UltraFastBERT paper☆514Updated 8 months ago
- llama.cpp with BakLLaVA model describes what does it see☆381Updated last year
- A minimal Tensor Processing Unit (TPU) inspired by Google's TPUv1.☆117Updated 3 months ago
- TinyChatEngine: On-Device LLM Inference Library☆749Updated 4 months ago
- Mistral7B playing DOOM☆122Updated 4 months ago
- gpt-2 from scratch in mlx☆358Updated 5 months ago
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆230Updated this week
- Efficient Inference of Transformer models☆392Updated 3 months ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆977Updated 5 months ago
- LLaVA server (llama.cpp).☆177Updated last year
- GGUF implementation in C as a library and a tools CLI program☆244Updated 4 months ago
- ☆860Updated 11 months ago
- Neural Autonomous Navigation Observer is a set of very small DNNs for drones to detect a few simple objects☆164Updated 9 months ago
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆167Updated 3 months ago
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆234Updated 7 months ago
- throwaway GPT inference☆139Updated 5 months ago
- llama3.cuda is a pure C/CUDA implementation for Llama 3 model.☆309Updated 5 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆897Updated 2 months ago