iangitonga / tinyllama.cppLinks
A C++ implementation of tinyllama inference on CPU.
☆12Updated last year
Alternatives and similar repositories for tinyllama.cpp
Users that are interested in tinyllama.cpp are comparing it to the libraries listed below
Sorting:
- instinct.cpp provides ready to use alternatives to OpenAI Assistant API and built-in utilities for developing AI Agent applications (RAG,…☆57Updated last year
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆42Updated 7 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Thin wrapper around GGML to make life easier☆42Updated 3 months ago
- Inference Llama 2 in one file of pure C☆12Updated 2 years ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Updated last year
- EdgeInfer enables efficient edge intelligence by running small AI models, including embeddings and OnnxModels, on resource-constrained de…☆50Updated last year
- A chat UI for Llama.cpp☆15Updated 2 months ago
- Light WebUI for lm.rs☆24Updated last year
- AirLLM 70B inference with single 4GB GPU☆17Updated 7 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated last year
- Inference Llama 2 in one file of pure C++☆87Updated 2 years ago
- 基于youtube、bilibili等视频平台、webpage网页等,利用零一万物大模型或ollama本地小模型构建大语言模型高质量训练数据集(计划支持可自定义输出的 训练数据格式)☆19Updated last year
- A converter and basic tester for rwkv onnx☆43Updated 2 years ago
- A game of pong made by MetaGPT and ChatGPT Code Interpreter☆14Updated 2 years ago
- Another frontend for Ollama☆30Updated 2 months ago
- Automate building of TeXmacs on windows using MSys2/Mingw-w32☆10Updated 3 years ago
- A set of visualization engines.☆14Updated last week
- ☆17Updated 2 months ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- The Lily programming language ⚜☆10Updated last month
- 基于分形理论中的参数L-系统,使用OpenGL与VC++实现了真实度较高的三维树木,并且可以通过调整参数进行树木的变换。☆12Updated 7 years ago
- ☆14Updated 2 years ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆15Updated 2 years ago
- BlinkDL's RWKV-v4 running in the browser☆48Updated 2 years ago
- Running Microsoft's BitNet via Electron, React & Astro☆52Updated 4 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- RWKV (Receptance Weighted Key Value) is a RNN with Transformer-level performance☆41Updated 2 years ago
- An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API☆18Updated 5 months ago
- SQL-parser implemented in LISP☆14Updated 3 years ago