A "large" language model running on a microcontroller
☆555Dec 9, 2023Updated 2 years ago
Alternatives and similar repositories for llama4micro
Users that are interested in llama4micro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Llama 2 Everywhere (L2E)☆1,526Aug 27, 2025Updated 10 months ago
- TensorFlow Lite for BL602☆12Jun 22, 2021Updated 5 years ago
- Inference Llama 2 in one file of pure C☆19,682Aug 6, 2024Updated last year
- ☆1,276Oct 24, 2023Updated 2 years ago
- llama.cpp with BakLLaVA model describes what does it see☆379Nov 8, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Simple CogVLM client script☆13Dec 20, 2023Updated 2 years ago
- Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥☆1,687Jan 14, 2025Updated last year
- ⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Pl…☆2,176Oct 8, 2024Updated last year
- An open source wearable with camera☆624May 12, 2024Updated 2 years ago
- A VS Code Workspace for developing Zephyr Projects☆11Jun 7, 2023Updated 3 years ago
- ☆40May 10, 2024Updated 2 years ago
- smolLM with Entropix sampler on pytorch☆149Oct 31, 2024Updated last year
- Chrome extension to watch YouTube videos ad-free☆17Jun 7, 2024Updated 2 years ago
- TensorFlow Lite Micro Library for Arduino☆22Jul 5, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,999May 3, 2024Updated 2 years ago
- Highly commented implementations of Transformers in PyTorch☆139Aug 2, 2023Updated 2 years ago
- QLoRA for Masked Language Modeling☆23Sep 11, 2023Updated 2 years ago
- Just a bunch of benchmark logs for different LLMs☆130Jul 28, 2024Updated last year
- Local ML voice chat using high-end models.☆188Jun 4, 2026Updated 3 weeks ago
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆80Jan 28, 2024Updated 2 years ago
- MLX: An array framework for Apple silicon☆27,375Updated this week
- Zephyr module including a little build system for Lua and usage samples☆12Aug 21, 2025Updated 10 months ago
- Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.☆10,598Jul 1, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly☆33Oct 19, 2024Updated last year
- Tensor library for machine learning☆14,871Jun 19, 2026Updated last week
- Distribute and run LLMs with a single file.☆25,105Updated this week
- LLM inference in C/C++☆118,422Updated this week
- Universal LLM Deployment Engine with ML Compilation☆22,863May 11, 2026Updated last month
- blablado is an extensible Assistant that listens to your voice and can execute custom Python functions you provided. It can speak as well…☆69Aug 4, 2024Updated last year
- tiny vision language model☆9,810Apr 20, 2026Updated 2 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆6,224Aug 22, 2025Updated 10 months ago
- GGUF implementation in C as a library and a tools CLI program☆342May 16, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS 2020] MCUNet: Tiny Deep Learning on IoT Devices; [NeurIPS 2021] MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep L…☆951Nov 27, 2024Updated last year
- Data extraction with LLM on CPU☆69Nov 14, 2023Updated 2 years ago
- Extract information, summarize, ask questions, and search videos using OpenAI's Vision API 🚀🎦☆61Nov 7, 2023Updated 2 years ago
- AI narrator☆15Nov 24, 2023Updated 2 years ago
- Apache NuttX RTOS on FPGA☆16Feb 20, 2024Updated 2 years ago
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,678Jun 22, 2026Updated last week
- ☆140Feb 20, 2024Updated 2 years ago