shubham0204 / llama.cpp-simple-chat-interfaceLinks
Build a simple CMD chat interface with llama.cpp and C++
☆9Updated 3 months ago
Alternatives and similar repositories for llama.cpp-simple-chat-interface
Users that are interested in llama.cpp-simple-chat-interface are comparing it to the libraries listed below
Sorting:
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆49Updated 11 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆48Updated 3 months ago
- A low latency, fault tolerant API for accessing LLM's written in C++ using llama.cpp.☆10Updated 2 months ago
- mnn tts demo.☆16Updated 3 weeks ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆68Updated last week
- A chat UI for Llama.cpp☆13Updated this week
- Thin wrapper around GGML to make life easier☆34Updated this week
- Recording models☆13Updated last year
- Ask shortgpt for instant and concise answers☆13Updated 2 years ago
- ☆66Updated 2 years ago
- ggml implementation of embedding models including SentenceTransformer and BGE☆58Updated last year
- Rust bindings for CTranslate2☆14Updated last year
- YOLOv10 C++ implementation using OpenVINO for efficient and accurate real-time object detection.☆69Updated 2 months ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆21Updated last year
- Rust crate for some audio utilities☆23Updated 2 months ago
- Streaming TTS based on Piper with optional RK3588 NPU support☆89Updated last month
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆40Updated this week
- Inference slice of marian for bergamot's tiny11 models. Faster to compile, and wield. Fewer model-archs than bergamot-translator.☆11Updated 7 months ago
- Run Vision LLMs, TTS and STT APIs. Website and API for https://text-generator.io☆35Updated this week
- A Toolkit to Help Optimize Onnx Model☆153Updated this week
- Browse, search, and visualize ONNX models.☆30Updated last month
- Inference deployment of the llama3☆11Updated last year
- Light WebUI for lm.rs☆23Updated 7 months ago
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 3 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆20Updated 2 months ago
- TTS support with GGML☆43Updated this week
- ☆11Updated 4 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆123Updated last year