absadiki / pyllamacpp
Python bindings for llama.cpp
☆64Updated 10 months ago
Alternatives and similar repositories for pyllamacpp:
Users that are interested in pyllamacpp are comparing it to the libraries listed below
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated last year
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆67Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆35Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆73Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆112Updated last year
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- A prompt/context management system☆167Updated last year
- GPT-2 small trained on phi-like data☆65Updated 11 months ago
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated last year
- Local LLM ReAct Agent with Guidance☆155Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆245Updated 11 months ago
- 🔓 The open-source autonomous agent LLM initiative 🔓☆90Updated 11 months ago
- The code we currently use to fine-tune models.☆112Updated 8 months ago
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆116Updated last year
- A Qt GUI for large language models☆40Updated last year
- Roy: A lightweight, model-agnostic framework for crafting advanced multi-agent systems using large language models.☆79Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆156Updated last year
- Harnessing the Memory Power of the Camelids☆146Updated last year
- An OpenAI-like LLaMA inference API☆113Updated last year
- ☆275Updated last year
- A guidance language for controlling large language models.☆44Updated last year
- BabyAGI-🦙: Enhanced for Llama models (running 100% local) and persistent memory, with smart internet search based on BabyCatAGI and docu…☆89Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobol…☆212Updated last year
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- Python bindings for the C++ port of GPT4All-J model.☆38Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆72Updated last year
- Just a simple HowTo for https://github.com/johnsmith0031/alpaca_lora_4bit☆31Updated last year