daskol / llama.py
Python bindings to llama.cpp
☆27Updated last year
Related projects ⓘ
Alternatives and complementary repositories for llama.py
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆20Updated last week
- Alpha-Zero Connect Four NN trained via self play☆13Updated last month
- 5X faster 60% less memory QLoRA finetuning☆21Updated 5 months ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆22Updated last year
- A live multiplayer trivia game where users can bid for the subject of the next question☆22Updated last week
- Local LLM inference & management server with built-in OpenAI API☆31Updated 6 months ago
- Experimental sampler to make LLMs more creative☆30Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- Simple LLM inference server☆17Updated 4 months ago
- a tiny, exploitable chatbot that can use tools☆30Updated last year
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated 9 months ago
- ☆25Updated 2 months ago
- ☆55Updated 11 months ago
- Testing KAN-based text generation GPT models☆15Updated 6 months ago
- 4 bits quantization of SantaCoder using GPTQ☆53Updated last year
- Inference code for LLaMA models☆35Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated last year
- Merge LLM that are split in to parts☆25Updated last year
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆91Updated last year
- ☆31Updated 10 months ago
- ☆49Updated 7 months ago
- ☆43Updated 3 months ago
- Simple setup to self-host LLaMA3-70B model with an OpenAI API☆18Updated 6 months ago
- A QT GUI for large language models☆24Updated 10 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated 11 months ago
- Integrate an LLM copilot within your Keras model development workflow☆28Updated last year
- Light WebUI for lm.rs☆21Updated 3 weeks ago
- Mistral7B playing DOOM☆27Updated 7 months ago
- a version of baby agi using dspy and typed predictors☆17Updated 8 months ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated this week