yacineMTB / llama.cpp

Port of Facebook's LLaMA model in C/C++

☆16

Alternatives and similar repositories for llama.cpp:

Users that are interested in llama.cpp are comparing it to the libraries listed below

teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated last year
sfcompute / tinynarrations
A synthetic story narration dataset to study small audio LMs.
☆31Updated 11 months ago
yacineMTB / just-large-models
Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.
☆44Updated last year
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆60Updated 8 months ago
MF-FOOM / wikivec2text
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆153Updated last year
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆101Updated last year
danielgross / ggml-k8s
Run GGML models with Kubernetes.
☆173Updated last year
recmo / cria
Tiny inference-only implementation of LLaMA
☆91Updated 9 months ago
Algomancer / The-Daily-Train
Training Models Daily
☆17Updated last year
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆26Updated last month
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆99Updated last year
RyanLucas3 / poasterGPT
A single notebook for fine-tuning GPT-3.5 turbo
☆31Updated 5 months ago
yacineMTB / whisper.cpp
Port of OpenAI's Whisper model in C/C++
☆10Updated last year
kayvr / token-hawk
WebGPU LLM inference tuned by hand
☆148Updated last year
lachlansneff / sparsellama
☆40Updated last year
hitorilabs / navi
compute, storage, and networking infra at home
☆64Updated 11 months ago
Birch-san / falcon-play
Command-line script for inferencing from models such as falcon-7b-instruct
☆76Updated last year
mikaelhaji / n1-codec
a highly efficient compression algorithm for the n1 implant (neuralink's compression challenge)
☆46Updated 7 months ago
teknium1 / stanford_alpaca-replit
Modified Stanford-Alpaca Trainer for Training Replit's Code Model
☆40Updated last year
Alignment-Lab-AI / AutoMaticAssistant
☆24Updated last year
teknium1 / alpaca-discord
A Simple Discord Bot for the Alpaca LLM
☆101Updated last year
doomslide / autoloom
☆20Updated 2 months ago
atroyn / math-llm
Grounding LLM mathematical reasoning with proof assistants.
☆60Updated last year
lumpenspace / FRAG
Flexible, efficient, and context-aware generation from large unstructured knowledge sources.
☆15Updated 8 months ago
wozeparrot / tinyrwkv
tinygrad port of the RWKV large language model.
☆44Updated 7 months ago
ishan0102 / rsrch.space
Stream of my favorite papers and links
☆39Updated 4 months ago
notarussianteenager / srf-attention
Simplex Random Feature attention, in PyTorch
☆72Updated last year
Chillee / llm.c
LLM training in simple, raw C/CUDA
☆18Updated 8 months ago
kenshin9000 / ConceptARC-Representations
This repository explains and provides examples for "concept anchoring" in GPT4.
☆72Updated last year