awinml / llama-cpp-python-bindings
Run fast LLM Inference using Llama.cpp in Python
☆18Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for llama-cpp-python-bindings
- Tutorial for DSPy☆21Updated 6 months ago
- ☆20Updated 9 months ago
- A generalist agent that can go online and accomplish complex tasks using semantic-kernel and autogen.☆25Updated 11 months ago
- Solve Geometric & Graph Problems with Large Language Models☆28Updated last year
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆35Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆33Updated 8 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context rec…☆28Updated 3 months ago
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆23Updated 11 months ago
- Testing the different LLM and RAG Tests while I learn along the way☆17Updated last month
- Medical Mixture of Experts LLM using Mergekit.☆20Updated 8 months ago
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- ☆24Updated 8 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆29Updated 6 months ago
- OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…☆28Updated 10 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- GGUF Quantization of any LLM.☆29Updated 8 months ago
- HuggingChat like UI in Gradio☆64Updated last year
- AI_Powered_Dev_Search_Engine☆12Updated 8 months ago
- Metadata Enrichment using KeyBERT for advanced and improved RAG.☆10Updated 11 months ago
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆20Updated this week
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- A QT GUI for large language models☆24Updated 10 months ago
- Your Python AI Coder!☆28Updated 3 weeks ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆15Updated this week
- Github repo for storing LlamaDatasets☆29Updated 10 months ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆11Updated 6 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 9 months ago