awinml / llama-cpp-python-bindings
Run fast LLM Inference using Llama.cpp in Python
☆17Updated last year
Alternatives and similar repositories for llama-cpp-python-bindings:
Users that are interested in llama-cpp-python-bindings are comparing it to the libraries listed below
- ☆52Updated last month
- Tutorial for DSPy☆23Updated 11 months ago
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆34Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- ☆20Updated last year
- ☆41Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- Finetune any model on HF in less than 30 seconds☆58Updated 2 months ago
- Universal text classifier for generative models☆22Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- This repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function call…☆16Updated 11 months ago
- Metadata Enrichment using KeyBERT for advanced and improved RAG.☆10Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆32Updated last year
- ☆25Updated last year
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆56Updated last year
- Building Knowledge Graph-Driven Chatbot with ChatGPT and ArangoDB☆20Updated last year
- Pandas-LLM☆41Updated last year
- Streamlit application that helps users analyze RFP's using the latest Gemini 2.0 Flash Experimental LLM.☆13Updated 3 months ago
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆51Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆20Updated 3 weeks ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 7 months ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Building Private Healthcare AI Assistant for Clinics Using Qdrant Hybrid Cloud, DSPy and Groq - Llama3☆22Updated 10 months ago
- Simple examples using Argilla tools to build AI☆53Updated 4 months ago
- Contains Google Colab or Jupyter notebooks, as well as other associated files for my Medium blogposts.☆35Updated 10 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 5 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated last year