Run and explore Llama models locally with minimal dependencies on CPU
☆188Oct 12, 2024Updated last year
Alternatives and similar repositories for run-llama-locally
Users that are interested in run-llama-locally are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Minimal LLM inference in Rust☆1,036Oct 24, 2024Updated last year
- Docker-based inference engine for AMD GPUs☆233Oct 7, 2024Updated last year
- Sequential Logic☆114Apr 5, 2026Updated 2 weeks ago
- This is a python implementation for stitching images.☆231Oct 3, 2024Updated last year
- SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.☆1,856Jan 4, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- VortexNet: Neural Computing through Fluid Dynamics☆49Jan 19, 2025Updated last year
- Control your Roku with hand gestures using Mediapipe and Python☆17Dec 5, 2024Updated last year
- ☆199May 5, 2025Updated 11 months ago
- Sync your personal dotfiles to an encrypted github☆18Mar 19, 2024Updated 2 years ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆630Feb 24, 2025Updated last year
- Analyze your image in seconds with AI☆63May 28, 2024Updated last year
- A project providing a Graphic Walker Pane for use with HoloViz Panel.☆352Dec 3, 2025Updated 4 months ago
- A browser-based, WebGL2 implementation of GPT-2 with transform block and attention matrix visualization☆342Oct 24, 2025Updated 5 months ago
- A Modern C++ Wrapper for TensorFlow☆53May 8, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OpenCV+YOLO+LLAVA powered video surveillance system☆792Oct 21, 2025Updated 5 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯☆890Dec 10, 2025Updated 4 months ago
- LD_PRELOADable library for exploring the glibc heap☆108Mar 6, 2025Updated last year
- minimal carball on linux. fun for humans and ai!☆205Oct 29, 2024Updated last year
- Felafax is building AI infra for non-NVIDIA GPUs☆570Jan 24, 2025Updated last year
- AcSecurity is a Python module designed to scan applications for common security vulnerabilities. It checks for hardcoded secrets, depende…☆16Aug 29, 2025Updated 7 months ago
- LLM Analytics☆709Oct 19, 2024Updated last year
- Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …☆361May 21, 2025Updated 10 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆289Mar 18, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- In-browser g-code simulator☆133Feb 28, 2026Updated last month
- Self-hosted environment for programming tutorial by LLM☆298Apr 8, 2026Updated last week
- An easily-trained baby GPT that can stand in for the real thing. Based on Andrej Karpathy's makemore, but set up to mimic a llama-cpp ser…☆28Dec 12, 2023Updated 2 years ago
- Personal Cloud Storage and File Management Solution with Privacy and Security☆18Nov 12, 2024Updated last year
- Financial instrument definitions built with Python and Pydantic☆198Feb 14, 2025Updated last year
- Work with LLMs on a local environment using containers☆291Apr 11, 2026Updated last week
- Simple inbound/outbound packet sniffer☆31Oct 2, 2024Updated last year
- Self-hostable TikTok feed for your clips. Make a TikTok feed with your own videos☆342Mar 13, 2026Updated last month
- A collection of tamagotchi characters to give AI assistants an identity.☆58Oct 28, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Animating R1's thoughts.☆382Feb 17, 2025Updated last year
- A simple app for downloading YouTube Shorts transcripts. Built to self-host with Python and Streamlit. Free and open source.☆32Dec 4, 2024Updated last year
- Collection of high-performance, CUDA-accelerated fluid dynamics and physics simulators, including SPH, hypersonic flow, and reaction-diff…☆61Feb 23, 2026Updated last month
- dead mans switch without reliance on your system, infra, or application☆329Apr 12, 2026Updated last week
- A simple app for downloading YouTube videos, transcripts, thumbnails, and channel data all in one place. Built to self-host with Python,…☆17Dec 4, 2024Updated last year
- donut is a zero setup required SRT+MPEG-TS -> WebRTC Bridge powered by Pion.☆360Jun 7, 2024Updated last year
- Create a clean, easy to read resume in pure Python.☆82Dec 1, 2025Updated 4 months ago