jart / llama.cpp
Port of Facebook's LLaMA model in C/C++
☆22Updated 5 months ago
Alternatives and similar repositories for llama.cpp:
Users that are interested in llama.cpp are comparing it to the libraries listed below
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆23Updated this week
- Tensor library for machine learning☆21Updated last year
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆45Updated 4 months ago
- Inference Llama 2 in one file of pure C☆29Updated last year
- ☆18Updated 4 months ago
- ☆16Updated 4 months ago
- Explore training for quantized models☆15Updated last month
- Effort to open-source 10.5 trillion parameter Gemini model.☆17Updated last year
- A chat UI for Llama.cpp☆12Updated last week
- The official Python library for Formulaic☆16Updated 9 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆28Updated last week
- A novel temporal fusion framework for propelling autoregressive model inference☆11Updated this week
- ☆26Updated 11 months ago
- Lightweight OpenAI wrapper using FastAPI. Add rate limits to OpenAI usage, optionally log and store all API calls, and share regulated Op…☆13Updated last year
- Generate python ctypes classes from C headers. Requires LLVM clang☆15Updated 6 months ago
- Inference Llama 2 in C++☆45Updated 9 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆17Updated 4 months ago
- Experiments with BitNet inference on CPU☆53Updated 10 months ago
- minimalistic AI library that resembles HF's transformers☆12Updated last month
- ⚡ FutureGPT - Application development framework that connects GPT-4 with external data, the internet, other applications and language mod…☆12Updated last year
- ☆65Updated 2 months ago
- Github repo for Peifeng's internship project☆13Updated last year
- MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. …☆13Updated 6 months ago
- Inference Llama/Llama2/Llama3 Modes in NumPy☆20Updated last year
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆38Updated 2 months ago
- Document Automation Reference Kit☆14Updated 7 months ago
- ☆40Updated last year
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆38Updated 2 weeks ago
- ☆32Updated last year