AIAnytime / GGUF-Quantization-of-any-LLM
GGUF Quantization of any LLM.
☆37Updated last year
Alternatives and similar repositories for GGUF-Quantization-of-any-LLM:
Users that are interested in GGUF-Quantization-of-any-LLM are comparing it to the libraries listed below
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆23Updated last year
- Multimodal AI App using Llava 7B and Gradio.☆38Updated 10 months ago
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆20Updated last week
- run ollama & gguf easily with a single command☆49Updated 10 months ago
- Groq-Whisper Fast Transcription App built using Groq API and Streamlit.☆23Updated 6 months ago
- Using GPT-3 and Carrot (GPT-3 for computer vision) to create detailed descriptions of images.☆13Updated 3 years ago
- ☆52Updated last month
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆13Updated 11 months ago
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆34Updated last year
- Chrome Extension powered by LLM☆17Updated last year
- A chatbot UI for RAG, multimodal, text completion. (support Transformers, llama.cpp, MLX, vLLM)☆19Updated 11 months ago
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- ☆14Updated 11 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆43Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- PubMed Healthcare Chatbot. LLM Augmented Q&A over PubMed Search Engine.☆22Updated last year
- Agent Watch is an AgentOps monitoring library designed for Crew AI applications.☆17Updated 3 months ago
- ☆21Updated 4 months ago
- ☆11Updated last year
- Fine Tuning Multimodal LLM "Idefics 9B" on Pokemon Go Dataset available on Hugging Face.☆19Updated last year
- Medical Help App using GPT-4V☆25Updated last year
- Question Answer Generation App using Mistral 7B, Langchain, and FastAPI.☆64Updated last year
- ☆46Updated last year
- Chat with Your Data App using Langchain, ChromaDB, Sentence Transformers, and LaMiNi LM Model. This Chatbot is completely powered by Open…☆17Updated last year
- Interactive notes (Jupyter Notebooks) for building AI-powered applications☆31Updated 10 months ago
- This is a RAG implementation using Open Source stack. BioMistral 7B has been used to build this app along with PubMedBert as an embedding…☆68Updated this week