awinml / llama-cpp-python-bindingsLinks
Run fast LLM Inference using Llama.cpp in Python
☆19Updated 2 years ago
Alternatives and similar repositories for llama-cpp-python-bindings
Users that are interested in llama-cpp-python-bindings are comparing it to the libraries listed below
Sorting:
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- ☆55Updated 5 months ago
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆36Updated 2 years ago
- Tutorial for DSPy☆26Updated last year
- Simple examples using Argilla tools to build AI☆57Updated last year
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆105Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- ☆44Updated last year
- Embed anything.☆27Updated last year
- YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smooth…☆58Updated last year
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆38Updated 2 years ago
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆24Updated 2 years ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated 2 years ago
- ☆68Updated last year
- Data extraction with LLM on CPU☆112Updated 2 years ago
- One Repo To Quickly Build One Docker File for HuggingChat Front and BackEnd☆26Updated 2 years ago
- ☆29Updated 2 years ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Updated 2 months ago
- ☆10Updated last year
- Large Language Model (LLM) Inference API and Chatbot☆128Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- Dynamic Metadata based RAG Framework☆78Updated 2 months ago
- Examples of RAG using Llamaindex with local LLMs - Gemma, Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆130Updated last year
- Function Calling Benchmark & Testing☆92Updated last year
- ☆20Updated 2 years ago
- RAG example using DSPy, Gradio, FastAPI☆90Updated last year
- Retrieval Augmented Generation (RAG) on audio data with LangChain☆15Updated 2 years ago