A minimalistic C++ Jinja templating engine for LLM chat templates
☆214Sep 22, 2025Updated 8 months ago
Alternatives and similar repositories for minja
Users that are interested in minja are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GGML implementation of BERT model with Python bindings and quantization.☆57Feb 19, 2024Updated 2 years ago
- Example and helpers for building rust projects under cmake☆16Oct 5, 2018Updated 7 years ago
- Semantic emoji finder. Python/dash UI. Uses sentence transformer embeddings and duckdb☆20Sep 15, 2025Updated 8 months ago
- FMO (Friendli Model Optimizer)☆14Jan 8, 2025Updated last year
- Forced alignment decoder for Whisper.☆16Mar 13, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Editor with LLM generation tree exploration☆86Feb 12, 2025Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Thin wrapper around GGML to make life easier☆45Nov 5, 2025Updated 6 months ago
- A lovely structopt library for C++! Parse command line arguments by defining a struct! ❤️☆12Apr 24, 2023Updated 3 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Build GenServer Based Graphs☆18Aug 22, 2025Updated 9 months ago
- faster inference☆28Jan 20, 2025Updated last year
- For audio visualization and playback in Jupyter notebooks.☆17Nov 25, 2025Updated 6 months ago
- ☆15May 11, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI☆50Jun 25, 2025Updated 11 months ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- Yet another `llama.cpp` Rust wrapper☆12May 12, 2026Updated last week
- EDN I/O library for Objective-C (MacOS and iOS)☆31Jan 29, 2025Updated last year
- TensorFlow Lite C precompiled library for Windows, Linux and macOS☆15Dec 30, 2024Updated last year
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Mar 6, 2025Updated last year
- A minimalistic Swift implementation of the Jinja templating engine, specifically designed for parsing and rendering ML chat templates.☆128May 16, 2026Updated last week
- A lightweight Python library for running TTS models with a unified API.☆20Feb 18, 2025Updated last year
- An introduction to DSPy☆34Aug 30, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Alpaca Core local daemon☆24May 27, 2025Updated 11 months ago
- NanoGPT (124M) in 5 minutes☆15Feb 14, 2025Updated last year
- OCTAVE protocol - structured AI communication with 3-20x token reduction. MCP server with lenient-to-canonical pipeline and schema valida…☆52Updated this week
- Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++☆6,080Updated this week
- Library that makes easy to display property edit screens for SwiftUI.☆18Mar 31, 2024Updated 2 years ago
- GGUF implementation in C as a library and a tools CLI program☆327May 16, 2026Updated last week
- Inference RWKV v7 in pure C.☆44Oct 10, 2025Updated 7 months ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.☆115Jun 4, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Jan 1, 2024Updated 2 years ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated 2 years ago
- Low level C++ neural network engine. The engine provides a huge flexibility in creating neural networks. It also gives an ability for per…☆11Jan 9, 2024Updated 2 years ago
- Google Scholar自搜小脚本,每次开启命令行即显示当前citation。Small Script displaying current citation count each time the shell is opened.☆21Mar 3, 2025Updated last year
- A streaming audio reader, processor, and writer built on top of soundfile, and PyAV (bindings for FFmpeg)☆38Mar 31, 2026Updated last month
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆15Oct 24, 2023Updated 2 years ago
- A simple frontend page to interact with an OpenAI like API☆16Jan 31, 2025Updated last year