yacineMTB / llama.cpp
Port of Facebook's LLaMA model in C/C++
☆16Updated last year
Alternatives and similar repositories for llama.cpp:
Users that are interested in llama.cpp are comparing it to the libraries listed below
- A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.☆30Updated last year
- A synthetic story narration dataset to study small audio LMs.☆31Updated 11 months ago
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 8 months ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆153Updated last year
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- Run GGML models with Kubernetes.☆173Updated last year
- Tiny inference-only implementation of LLaMA☆91Updated 9 months ago
- Training Models Daily☆17Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆26Updated last month
- inference code for mixtral-8x7b-32kseqlen☆99Updated last year
- A single notebook for fine-tuning GPT-3.5 turbo☆31Updated 5 months ago
- Port of OpenAI's Whisper model in C/C++☆10Updated last year
- WebGPU LLM inference tuned by hand☆148Updated last year
- ☆40Updated last year
- compute, storage, and networking infra at home☆64Updated 11 months ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆76Updated last year
- a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)☆46Updated 7 months ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year
- ☆24Updated last year
- A Simple Discord Bot for the Alpaca LLM☆101Updated last year
- ☆20Updated 2 months ago
- Grounding LLM mathematical reasoning with proof assistants.☆60Updated last year
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆15Updated 8 months ago
- tinygrad port of the RWKV large language model.☆44Updated 7 months ago
- Stream of my favorite papers and links☆39Updated 4 months ago
- Simplex Random Feature attention, in PyTorch☆72Updated last year
- LLM training in simple, raw C/CUDA☆18Updated 8 months ago
- This repository explains and provides examples for "concept anchoring" in GPT4.☆72Updated last year