awinml / llama-cpp-python-bindingsLinks
Run fast LLM Inference using Llama.cpp in Python
☆19Updated 2 years ago
Alternatives and similar repositories for llama-cpp-python-bindings
Users that are interested in llama-cpp-python-bindings are comparing it to the libraries listed below
Sorting:
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- ☆55Updated 5 months ago
- Simple AI agents / assistants☆51Updated last year
- On-device LLM Inference using Mediapipe LLM Inference API.☆23Updated last year
- Tutorial for DSPy☆26Updated last year
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆105Updated last year
- ☆68Updated last year
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- Data extraction with LLM on CPU☆112Updated 2 years ago
- Dynamic Metadata based RAG Framework☆78Updated 2 months ago
- RAG example using DSPy, Gradio, FastAPI☆90Updated last year
- Build a Streamlit Chatbot using Langchain, ColBERT, Ragatouille, and ChromaDB☆122Updated 2 years ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆38Updated 2 years ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆106Updated 2 years ago
- Embed anything.☆27Updated last year
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆36Updated 2 years ago
- ☆119Updated last year
- Own your AI, search the web with it🌐😎☆94Updated last year
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆87Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- A collection of AI related python scripts for things like training, RAG and agents.☆29Updated 10 months ago
- Local first human friendly agents toolkit for the browser and Nodejs☆45Updated this week
- Large Language Model (LLM) Inference API and Chatbot☆128Updated last year
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆20Updated last year
- ☆29Updated 2 years ago
- Chat with Documents from scratch using LLMs and a vector databse☆18Updated last year
- Widest collection of generative ai usecases in enterprise & startups☆18Updated 2 years ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated last year
- Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.☆134Updated 2 weeks ago