absadiki / pyllamacppLinks
Python bindings for llama.cpp
β65Updated last year
Alternatives and similar repositories for pyllamacpp
Users that are interested in pyllamacpp are comparing it to the libraries listed below
Sorting:
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦β75Updated last year
- The code we currently use to fine-tune models.β114Updated last year
- Falcon LLM ggml framework with CPU and GPU supportβ246Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.β64Updated last year
- Harnessing the Memory Power of the Camelidsβ146Updated last year
- Local LLM ReAct Agent with Guidanceβ158Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge promptsβ110Updated last year
- β167Updated 2 years ago
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.β37Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradioβ37Updated 2 years ago
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hubβ162Updated last year
- oobaboga -text-generation-webui implementation of wafflecomposite - langchain-ask-pdf-localβ71Updated 2 years ago
- Load local LLMs effortlessly in a Jupyter notebook for testing purposes alongside Langchain or other agents. Contains Oobagooga and Kobolβ¦β213Updated 2 years ago
- A guidance compatibility layer for llama-cpp-pythonβ35Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytesβ¦β147Updated last year
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate codeβ44Updated 2 years ago
- GPT-2 small trained on phi-like dataβ66Updated last year
- Model REVOLVER, a human in the loop model mixing system.β33Updated last year
- An OpenAI-like LLaMA inference APIβ112Updated last year
- A prompt/context management systemβ170Updated 2 years ago
- This is our own implementation of 'Layer Selective Rank Reduction'β239Updated last year
- β276Updated 2 years ago
- A Qt GUI for large language modelsβ43Updated last year
- A fast batching API to serve LLM modelsβ183Updated last year
- Roy: A lightweight, model-agnostic framework for crafting advanced multi-agent systems using large language models.β78Updated last year
- β16Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRAβ123Updated 2 years ago
- β199Updated last year
- Instruct-tuning LLaMA on consumer hardwareβ66Updated 2 years ago
- Let's create synthetic textbooks together :)β75Updated last year