awinml / llama-cpp-python-bindingsLinks
Run fast LLM Inference using Llama.cpp in Python
☆17Updated last year
Alternatives and similar repositories for llama-cpp-python-bindings
Users that are interested in llama-cpp-python-bindings are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆42Updated last year
- ☆20Updated last year
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- Metadata Enrichment using KeyBERT for advanced and improved RAG.☆10Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Experimenting text-embeddings-inference server on both CPU and GPU☆18Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- Finetune any model on HF in less than 30 seconds☆57Updated 2 months ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated last month
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆71Updated 7 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated last month
- ☆20Updated last year
- Tools for merging pretrained large language models.☆19Updated last year
- ☆42Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆62Updated 10 months ago
- ☆29Updated last year
- ☆66Updated last year
- ☆54Updated 4 months ago
- Your Python AI Coder!☆34Updated last month
- ☆46Updated 9 months ago
- Tutorial for DSPy☆23Updated last year
- BH hackathon☆14Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆31Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆30Updated 2 months ago
- Building Knowledge Graph-Driven Chatbot with ChatGPT and ArangoDB☆20Updated last year