iangitonga / tinyllama.cppLinks
A C++ implementation of tinyllama inference on CPU.
☆12Updated last year
Alternatives and similar repositories for tinyllama.cpp
Users that are interested in tinyllama.cpp are comparing it to the libraries listed below
Sorting:
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆57Updated last year
- Inference Llama 2 in one file of pure C☆12Updated 2 years ago
- A chat UI for Llama.cpp☆15Updated 2 months ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆42Updated 7 months ago
- qwen2 and llama3 cpp implementation☆49Updated last year
- sherpa-onnx Go package for Windows☆13Updated this week
- Thin wrapper around GGML to make life easier☆42Updated 3 months ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Updated last year
- Light WebUI for lm.rs☆24Updated last year
- EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained de…☆50Updated last year
- A converter and basic tester for rwkv onnx☆43Updated 2 years ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- ggml implementation of BERT Embedding☆26Updated 2 years ago
- Inference Llama 2 in one file of pure C++☆87Updated 2 years ago
- Controllable Language Model Interactions in TypeScript☆10Updated last year
- A next-generation dynamic and high-performance language for AI and IOT with natural born distributed computing ability.☆68Updated this week
- pure go for rwkv☆19Updated 2 years ago
- An excel-like spreadsheet component for SQLPage☆16Updated 6 months ago
- A set of visualization engines.☆14Updated this week
- Compare openresty vs nginx + PUC_lua☆16Updated 2 years ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated last year
- Local Ollama with Qdrant RAG: Embed, index, and enhance models for retrieval-augmented generation. Get started with easy setup for powerf…☆25Updated last year
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆13Updated 2 years ago
- AI Based "Happiness Optimizer"☆12Updated last year
- xllamacpp - a Python wrapper of llama.cpp☆72Updated last week
- AirLLM 70B inference with single 4GB GPU☆17Updated 7 months ago
- ☆15Updated 9 months ago