AIAnytime / GGUF-Quantization-of-any-LLM
GGUF Quantization of any LLM.
☆38Updated last year
Alternatives and similar repositories for GGUF-Quantization-of-any-LLM
Users that are interested in GGUF-Quantization-of-any-LLM are comparing it to the libraries listed below
Sorting:
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆34Updated last year
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- ☆52Updated 3 months ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆13Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆38Updated last year
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆44Updated last year
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆23Updated last year
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆21Updated last week
- YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smooth…☆56Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Training Small Language Model☆24Updated last year
- ☆14Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Metadata Enrichment using KeyBERT for advanced and improved RAG.☆10Updated last year
- ☆16Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- ☆22Updated last year
- ☆21Updated 6 months ago
- run ollama & gguf easily with a single command☆50Updated last year
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆19Updated last year
- ☆32Updated last year
- Chat with Your Data App using Langchain, ChromaDB, Sentence Transformers, and LaMiNi LM Model. This Chatbot is completely powered by Open…☆17Updated last year
- Chainlit app for advanced RAG. Uses llamaparse, langchain, qdrant and models from groq.☆45Updated 11 months ago
- HuggingChat like UI in Gradio☆72Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆13Updated 3 years ago
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆17Updated 5 months ago
- A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).☆35Updated 9 months ago
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆20Updated last year
- Design to Website App using Generative AI. It is a streamlit application that uses OCR and LLM to generate code/website from design.☆20Updated last year