daskol / llama.pyLinks
Python bindings to llama.cpp
☆27Updated 2 years ago
Alternatives and similar repositories for llama.py
Users that are interested in llama.py are comparing it to the libraries listed below
Sorting:
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆45Updated 2 years ago
- A pythonic library providing light-weighted interface with LLMs☆131Updated 8 months ago
- a tiny, exploitable chatbot that can use tools☆32Updated 2 years ago
- Host the GPTQ model using AutoGPTQ as an API that is compatible with text generation UI API.☆90Updated 2 years ago
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated 2 years ago
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- Embedding models from Jina AI☆65Updated 2 years ago
- Python bindings for llama.cpp☆68Updated last year
- Drop in replacement for OpenAI, but with Open models.☆156Updated 2 years ago
- A collection of prompts for Llama☆102Updated 2 years ago
- A guidance compatibility layer for llama-cpp-python☆36Updated 2 years ago
- Simple LLM inference server☆20Updated last year
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- ☆12Updated last year
- GPT-2 small trained on phi-like data☆68Updated last year
- codellama on CPU without Docker☆25Updated 2 years ago
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-t…☆30Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆161Updated 2 years ago
- Port of Facebook's LLaMA model in C/C++☆21Updated 2 years ago
- An OpenAI-like LLaMA inference API☆113Updated 2 years ago
- Co-Coder is a Python package that streamlines error debugging from Open AI chat GPT and Google Bard by providing hints, example code, and…☆46Updated 2 years ago
- No-messing-around sh client for llama.cpp's server☆30Updated last year
- An offline CPU-first low-resource chat application to perform RAG on your corpus of data. Powered by OpenChat and CTranslate2.☆14Updated 8 months ago
- Plug n Play GBNF Compiler for llama.cpp☆28Updated 2 years ago
- Embeddings focused small version of Llama NLP model☆107Updated 2 years ago
- Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch☆70Updated last week
- Modified Beam Search with periodical restart☆12Updated last year