AIAnytime / GGUF-Quantization-of-any-LLMLinks

GGUF Quantization of any LLM.

☆40

Alternatives and similar repositories for GGUF-Quantization-of-any-LLM

Users that are interested in GGUF-Quantization-of-any-LLM are comparing it to the libraries listed below

Sorting:

githubpradeep / notebooks
☆54Updated 6 months ago
monk1337 / auto-ollama
run ollama & gguf easily with a single command
☆52Updated last year
JakeFurtaw / Chat-RAG
Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…
☆22Updated 3 months ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
AIAnytime / Zephyr-7B-beta-RAG-Demo
Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.
☆35Updated last year
deep-diver / gradio-chat
HuggingChat like UI in Gradio
☆71Updated 2 years ago
nodematiclabs / llama-3-finetune-unsloth
☆14Updated last year
kesamet / retrieval-augmented-generation
Retrieval augmented generation demos with open-source DeepSeek, Llama, Qwen, Mistral, Gemma
☆42Updated 3 weeks ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
langchain-ai / prompt-eval-recommendation
Streamlit app for recommending eval functions using prompt diffs
☆29Updated last year
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆63Updated 11 months ago
AIAnytime / Function-Calling-Mistral-7B
Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.
☆48Updated last year
l4b4r4b4b4 / AIDocks
LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT
☆27Updated last year
S1M0N38 / dspy-arxiv
Explore the use of DSPy for extracting features from PDFs 🔎
☆45Updated last year
shivamsanju / ragswift
🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform
☆38Updated last year
Luxadevi / Ollama-Companion
Webinterface for administrating Ollama and model Quantization with public endpoints and automized OPENAI proxy
☆50Updated 4 months ago
TuanaCelik / unstructuredio-haystack
💙 Unstructured Data Connectors for Haystack 2.0
☆17Updated last year
AIAnytime / Medical-Mixture-of-Experts-LLM
Medical Mixture of Experts LLM using Mergekit.
☆20Updated last year
davidberenstein1957 / dataset-viber
Dataset Viber is your chill repo for data collection, annotation and vibe checks.
☆47Updated 11 months ago
Rivridis / LLM-Assistant
Locally running LLM with internet access
☆96Updated last month
AIAnytime / agent-watch
Agent Watch is an AgentOps monitoring library designed for Crew AI applications.
☆19Updated 8 months ago
Ashufet / Superior-RAG-for-Complex-PDFs-using-LlamaParse
I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…
☆47Updated last year
flowaicom / flow-judge
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…
☆76Updated 9 months ago
AIAnytime / CrewAI-AgentOps
CrewAI AgentOps: Monitor your AI Agents
☆18Updated last year
QuixiAI / SystemChat
☆30Updated last year
run-llama / mixtral_ollama
☆46Updated last year
fw-ai / cookbook
Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.
☆120Updated last week
redis-developer / agentic-rag
Complete example of how to build an Agentic RAG architecture with Redis, Amazon Bedrock, and LlamaIndex.
☆96Updated 8 months ago
gabrielchua / groq-st-demo
☆22Updated last year
Aesthisia / LLMinator
Gradio based tool to run opensource LLM models directly from Huggingface
☆94Updated last year