daskol / llama.py

Python bindings to llama.cpp

☆27

Related projects ⓘ

Alternatives and complementary repositories for llama.py

kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆20Updated last week
advait / c4a0
Alpha-Zero Connect Four NN trained via self play
☆13Updated last month
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated 5 months ago
NolanoOrg / InstructLLaMa.cpp
Fast inference of Instruct tuned LLaMa on your personal devices.
☆22Updated last year
Mihaiii / trivia
A live multiplayer trivia game where users can bid for the subject of the next question
☆22Updated last week
elgatopanzon / gatogpt
Local LLM inference & management server with built-in OpenAI API
☆31Updated 6 months ago
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆30Updated last year
shivamsanju / ragswift
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆36Updated 9 months ago
silphendio / sliced_llama
Simple LLM inference server
☆17Updated 4 months ago
eryk-mazus / xoxo
a tiny, exploitable chatbot that can use tools
☆30Updated last year
laelhalawani / gguf_modeldb
A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…
☆12Updated 9 months ago
slashml / awesome-finetuning
☆25Updated 2 months ago
NousResearch / StripedHyenaTrainer
☆55Updated 11 months ago
kabachuha / nanoGPKANT
Testing KAN-based text generation GPT models
☆15Updated 6 months ago
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆53Updated last year
gmorenz / llama
Inference code for LLaMA models
☆35Updated last year
the-crypt-keeper / tiny_starcoder
Python examples using the bigcode/tiny_starcoder_py 159M model to generate code
☆44Updated last year
donaldafeith / Pytorch_Merge
Merge LLM that are split in to parts
☆25Updated last year
mzbac / AutoGPTQ-API
Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.
☆91Updated last year
Maximilian-Winter / llama_cpp_function_calling
☆31Updated 10 months ago
euclaise / supertrainer2000
☆49Updated 7 months ago
NolanoOrg / SpectraSuite
☆43Updated 3 months ago
catid / oaillama3
Simple setup to self-host LLaMA3-70B model with an OpenAI API
☆18Updated 6 months ago
shinomakoi / AI-Messenger
A QT GUI for large language models
☆24Updated 10 months ago
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆44Updated 11 months ago
fabprezja / keras-gpt-copilot
Integrate an LLM copilot within your Keras model development workflow
☆28Updated last year
samuel-vitorino / lm.rs-webui
Light WebUI for lm.rs
☆21Updated 3 weeks ago
PuchToTalk / DOOM-MistralAI
Mistral7B playing DOOM
☆27Updated 7 months ago
fsndzomga / baby_agi_dspy
a version of baby agi using dspy and typed predictors
☆17Updated 8 months ago
kyegomez / SelfExtend
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Updated this week