A collection of experiments related to LLM inference with llama.cpp/mlx
☆40May 28, 2026Updated this week
Alternatives and similar repositories for llama-sandbox
Users that are interested in llama-sandbox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Go language bindings for the ggwave C++ library☆14Apr 9, 2025Updated last year
- minimal C implementation of speculative decoding based on llama2.c☆30Jul 15, 2024Updated last year
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Build tools for Enyo 2.6+☆15Nov 26, 2018Updated 7 years ago
- ☆18Dec 7, 2023Updated 2 years ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆201Mar 18, 2026Updated 2 months ago
- Recording models☆12Sep 19, 2023Updated 2 years ago
- 本仓库在OpenVINO推理框架下部署Nanodet检测算法,并重写预处理和后处理部分,具有超高性能!让你在Intel CPU平台上的检测速度起飞! 并基于NNCF和PPQ工具将模型量化(PTQ)至int8精度,推理速度更快!☆16Jun 14, 2023Updated 2 years ago
- The application performs real-time inference on audio from an ALSA capture device☆39Jun 19, 2025Updated 11 months ago
- Repository for ICLR'23 Long-tailed Learning Requires Feature Learning☆10Feb 22, 2023Updated 3 years ago
- TensorRT实现BiSeNetV1与BiSeNetV2部署☆20Apr 14, 2022Updated 4 years ago
- ☆23Apr 10, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Apr 10, 2023Updated 3 years ago
- monodepth running in Android by ncnn☆23Oct 12, 2021Updated 4 years ago
- ☆20Dec 29, 2023Updated 2 years ago
- 📊 an html tap reporter☆20Sep 7, 2023Updated 2 years ago
- High-level, optionally asynchronous Rust bindings to llama.cpp☆245Jun 5, 2024Updated last year
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆17Jun 18, 2024Updated last year
- Downsampling array of intervals☆26Dec 11, 2019Updated 6 years ago
- LLM-powered lossless compression tool☆310Jan 2, 2026Updated 4 months ago
- ☆18Jun 16, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated 2 years ago
- An implementation of Deepmind's Promptbreeder.☆23Dec 22, 2023Updated 2 years ago
- DINOv2 inference engine written in C/C++ using ggml and OpenCV.☆92May 6, 2025Updated last year
- segment-anything based mnn☆37Dec 13, 2023Updated 2 years ago
- Gaussian blur for ImGui in Dx12☆32Mar 6, 2025Updated last year
- LightNet is an optimized deep learning framework based on the popular darknet platform. It is optimized to create efficient and high-spee…☆38Sep 17, 2023Updated 2 years ago
- [ICML 2022] Official implementation of "Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems".☆12Jul 19, 2022Updated 3 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- A well typed by construction kernel language for bidirectional programming☆14Jan 2, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is a joint implementation of AdaShift optimizer, LGANs, and MaxGP.☆14Oct 7, 2020Updated 5 years ago
- YoloV10 for a bare Raspberry Pi 4 or 5☆23Jun 21, 2024Updated last year
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- yolov7-pose end2end TRT实现☆27Sep 8, 2022Updated 3 years ago
- Using OpenAI's Whisper via whisper.cpp with SFML☆14Dec 2, 2025Updated 5 months ago
- Template repo for simple ImTui apps☆26Jul 25, 2022Updated 3 years ago