A "large" language model running on a microcontroller
☆554Dec 9, 2023Updated 2 years ago
Alternatives and similar repositories for llama4micro
Users that are interested in llama4micro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for Coral Dev Board Micro☆133Mar 25, 2026Updated last month
- Llama 2 Everywhere (L2E)☆1,527Aug 27, 2025Updated 8 months ago
- Inference Llama 2 in one file of pure C☆19,460Aug 6, 2024Updated last year
- TensorFlow Lite for BL602☆12Jun 22, 2021Updated 4 years ago
- ☆1,274Oct 24, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- llama.cpp with BakLLaVA model describes what does it see☆379Nov 8, 2023Updated 2 years ago
- Simple CogVLM client script☆13Dec 20, 2023Updated 2 years ago
- Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥☆1,685Jan 14, 2025Updated last year
- Running a LLM on the ESP32☆522Sep 4, 2024Updated last year
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,178Oct 8, 2024Updated last year
- An open source wearable with camera☆622May 12, 2024Updated last year
- Zucker SOC☆16Jun 11, 2025Updated 10 months ago
- ☆41May 10, 2024Updated last year
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- TensorFlow Lite Micro Library for Arduino☆22Jul 5, 2024Updated last year
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,954May 3, 2024Updated last year
- Highly commented implementations of Transformers in PyTorch☆138Aug 2, 2023Updated 2 years ago
- Zephyr module including a little build system for Lua and usage samples☆11Aug 21, 2025Updated 8 months ago
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆124Jul 28, 2024Updated last year
- Local ML voice chat using high-end models.☆188Apr 3, 2026Updated 3 weeks ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆80Jan 28, 2024Updated 2 years ago
- MLX: An array framework for Apple silicon☆25,814Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 100% open source dev kit for EOS S3 MCU+eFPGA SoC supported by fully open source SDK and FPGA Toolchain☆41Mar 24, 2021Updated 5 years ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,454Jul 1, 2024Updated last year
- Tensor library for machine learning☆14,560Updated this week
- Distribute and run LLMs with a single file.☆24,349Updated this week
- LLM inference in C/C++☆107,892Updated this week
- Universal LLM Deployment Engine with ML Compilation☆22,557Apr 22, 2026Updated last week
- This repository contains some examples of using borb in google colab. These examples enable you to try out the features of borb without i…☆13Sep 4, 2022Updated 3 years ago
- tiny vision language model☆9,613Apr 20, 2026Updated last week
- blablado is an extensible Assistant that listens to your voice and can execute custom Python functions you provided. It can speak as well…☆69Aug 4, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Fork of Paul's Inferno, Metal shaders for SwiftUI, to experimentally support visionOS☆16Jan 15, 2024Updated 2 years ago
- GGUF implementation in C as a library and a tools CLI program☆312Aug 28, 2025Updated 8 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,204Aug 22, 2025Updated 8 months ago
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆942Nov 27, 2024Updated last year
- Data extraction with LLM on CPU☆69Nov 14, 2023Updated 2 years ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆61Nov 7, 2023Updated 2 years ago
- AI narrator☆15Nov 24, 2023Updated 2 years ago