AIAnytime / GGUF-Quantization-of-any-LLMLinks
GGUF Quantization of any LLM. 
☆41Updated last year
Alternatives and similar repositories for GGUF-Quantization-of-any-LLM
Users that are interested in GGUF-Quantization-of-any-LLM are comparing it to the libraries listed below
Sorting:
- run ollama & gguf easily with a single command☆52Updated last year
 - Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆36Updated 2 years ago
 - ☆14Updated last year
 - ☆55Updated 2 months ago
 - Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆22Updated 5 months ago
 - ☆47Updated last year
 - High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated last year
 - Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
 - GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
 - Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
 - HuggingChat like UI in Gradio☆70Updated 2 years ago
 - Streamlit app for recommending eval functions using prompt diffs☆29Updated last year
 - Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
 - 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated 2 years ago
 - Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆20Updated 11 months ago
 - Simple Chainlit UI for running llms locally using Ollama and LangChain☆46Updated last year
 - CrewAI AgentOps: Monitor your AI Agents☆19Updated last year
 - ☆13Updated last year
 - Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
 - Modified Beam Search with periodical restart☆12Updated last year
 - Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated 2 years ago
 - Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
 - Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated 3 weeks ago
 - ☆20Updated last year
 - Own your AI, search the web with it🌐😎☆92Updated 9 months ago
 - LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
 - ☆31Updated last year
 - Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆46Updated last year
 - Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Updated 7 months ago
 - Question Answer Generation App using Mistral 7B, Langchain, and FastAPI.☆65Updated 2 years ago