A collection of experiments related to LLM inference with llama.cpp/mlx
☆40Mar 27, 2026Updated this week
Alternatives and similar repositories for llama-sandbox
Users that are interested in llama-sandbox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- minimal C implementation of speculative decoding based on llama2.c☆28Jul 15, 2024Updated last year
- Stable Diffusion in TensorRT 8.5+☆15Mar 19, 2023Updated 3 years ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆199Mar 18, 2026Updated last week
- Recording models☆12Sep 19, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Guess the Hacker News titles☆12Mar 24, 2022Updated 4 years ago
- Repository for ICLR'23 Long-tailed Learning Requires Feature Learning☆10Feb 22, 2023Updated 3 years ago
- TensorRT实现BiSeNetV1与BiSeNetV2部署☆20Apr 14, 2022Updated 3 years ago
- ☆22Apr 10, 2024Updated last year
- monodepth running in Android by ncnn☆23Oct 12, 2021Updated 4 years ago
- ☆12May 2, 2022Updated 3 years ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Jun 18, 2024Updated last year
- Downsampling array of intervals☆26Dec 11, 2019Updated 6 years ago
- A live multiplayer trivia game where users can bid for the subject of the next question☆29Jan 9, 2026Updated 2 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for CIKM 2021 best short paper nomination "Modeling Sequences as Distributions with Uncertainty for Sequential Recommendation" https…☆16Jun 11, 2021Updated 4 years ago
- ☆33Jul 23, 2024Updated last year
- Llama3 Streaming Chat Sample☆22Apr 24, 2024Updated last year
- DINOv2 inference engine written in C/C++ using ggml and OpenCV.☆89May 6, 2025Updated 10 months ago
- segment-anything based mnn☆36Dec 13, 2023Updated 2 years ago
- ☆16Jul 11, 2025Updated 8 months ago
- Gaussian blur for ImGui in Dx12☆33Mar 6, 2025Updated last year
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆288Jul 23, 2024Updated last year
- Speech-to-text transcription VST3/ARA plugin☆57Feb 2, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- A well typed by construction kernel language for bidirectional programming☆14Jan 2, 2025Updated last year
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- A friendly Zig launcher and toolchain manager.☆21Dec 2, 2025Updated 3 months ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated last year
- This is a joint implementation of AdaShift optimizer, LGANs, and MaxGP.☆14Oct 7, 2020Updated 5 years ago
- A tiny, didactical implementation of LLAMA 3☆42Dec 2, 2024Updated last year
- ☆30Nov 16, 2024Updated last year
- YoloV10 for a bare Raspberry Pi 4 or 5☆23Jun 21, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- YOLOv12 TensorRT 端到端模型加速推理和INT8量化实现☆13Mar 5, 2025Updated last year
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Nov 11, 2024Updated last year
- 🚧 SimpleXMQ - JavaScript SMP protocol client and agent 🏗☆13Jan 4, 2022Updated 4 years ago
- Nix-friendly fork of: Optimized Stable Diffusion modified to run on lower GPU VRAM☆10Sep 11, 2022Updated 3 years ago
- yolov7-pose end2end TRT实现☆27Sep 8, 2022Updated 3 years ago
- Contrastive Learning with Model Augmentation☆18Aug 3, 2022Updated 3 years ago