absadiki / pyllamacpp
Python bindings for llama.cpp
☆65Updated last year
Alternatives and similar repositories for pyllamacpp
Users that are interested in pyllamacpp are comparing it to the libraries listed below
Sorting:
- Harnessing the Memory Power of the Camelids☆146Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- An OpenAI-like LLaMA inference API☆112Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆75Updated last year
- Local LLM ReAct Agent with Guidance☆158Updated last year
- ☆168Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆162Updated last year
- Simple and fast server for GPTQ-quantized LLaMA inference☆24Updated last year
- A Qt GUI for large language models☆42Updated last year
- Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.☆35Updated last year
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆37Updated last year
- Creates an Langchain Agent which uses the WebUI's API and Wikipedia to work☆74Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆123Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- ☆277Updated last year
- Instruct-tuning LLaMA on consumer hardware☆66Updated 2 years ago
- A Simple Discord Bot for the Alpaca LLM☆101Updated last year
- A collection of prompts for Llama☆100Updated 2 years ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 8 months ago
- The code we currently use to fine-tune models.☆114Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated last year
- Roy: A lightweight, model-agnostic framework for crafting advanced multi-agent systems using large language models.☆78Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated last year
- 🔓 The open-source autonomous agent LLM initiative 🔓☆91Updated last year
- a tiny, exploitable chatbot that can use tools☆31Updated 2 years ago
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-local☆70Updated 2 years ago