awinml / llama-cpp-python-bindingsLinks
Run fast LLM Inference using Llama.cpp in Python
☆19Updated last year
Alternatives and similar repositories for llama-cpp-python-bindings
Users that are interested in llama-cpp-python-bindings are comparing it to the libraries listed below
Sorting:
- Large Language Model (LLM) Inference API and Chatbot☆126Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- ☆43Updated last year
- Tutorial for DSPy☆25Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- ☆55Updated 2 months ago
- ☆75Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆102Updated last year
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆76Updated last year
- YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smooth…☆57Updated last year
- On-device LLM Inference using Mediapipe LLM Inference API.☆22Updated last year
- Simple examples using Argilla tools to build AI☆56Updated last year
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆37Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- ☆29Updated 2 years ago
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆87Updated last year
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆131Updated last year
- Embed anything.☆27Updated last year
- ☆30Updated last year
- This repo contains codes covered in the youtube tutorials.☆86Updated 5 months ago
- Widest collection of generative ai usecases in enterprise & startups☆19Updated last year
- ☆25Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- Own your AI, search the web with it🌐😎☆92Updated 10 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆113Updated 7 months ago
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated 2 years ago
- RAG example using DSPy, Gradio, FastAPI☆86Updated last year
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ☆63Updated 2 years ago