awinml / llama-cpp-python-bindings
Run fast LLM Inference using Llama.cpp in Python
☆17Updated last year
Alternatives and similar repositories for llama-cpp-python-bindings:
Users that are interested in llama-cpp-python-bindings are comparing it to the libraries listed below
- Tutorial for DSPy☆22Updated 8 months ago
- ☆24Updated 10 months ago
- ☆20Updated 11 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- LLaMA implementation for HuggingFace Transformers☆38Updated last year
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆19Updated 4 months ago
- Measuring RAG solutions throughput and latency☆15Updated 5 months ago
- ☆50Updated last month
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 10 months ago
- ☆39Updated last month
- entropix style sampling + GUI☆25Updated 2 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated 11 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆22Updated last month
- GPT-4 Level Conversational QA Trained In a Few Hours☆58Updated 4 months ago
- AI_Powered_Dev_Search_Engine☆12Updated 10 months ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated 3 months ago
- Easily convert HuggingFace models to GGUF-format for llama.cpp☆21Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 8 months ago
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆30Updated 9 months ago
- ☆21Updated 10 months ago
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Updated last year
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpu…☆50Updated last year
- This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function call…☆16Updated 9 months ago
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆35Updated this week
- ☆12Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆37Updated 10 months ago
- tickr-agent is an enterprise-ready, scalable Python library for building swarms of financial agents that conduct comprehensive stock anal…☆37Updated last week
- Github repo for storing LlamaDatasets☆32Updated last year
- UnitEval is a benchmarking and evaluation tools for AutoDev Coder.☆11Updated last year
- HuggingChat like UI in Gradio☆69Updated last year