AIAnytime / GGUF-Quantization-of-any-LLM
GGUF Quantization of any LLM.
☆37Updated last year
Alternatives and similar repositories for GGUF-Quantization-of-any-LLM:
Users that are interested in GGUF-Quantization-of-any-LLM are comparing it to the libraries listed below
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.☆34Updated last year
- run ollama & gguf easily with a single command☆50Updated 11 months ago
- ☆52Updated 2 months ago
- Own your AI, search the web with it🌐😎☆84Updated 3 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆21Updated last month
- ☆14Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- Groq-Whisper Fast Transcription App built using Groq API and Streamlit.☆24Updated 7 months ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 9 months ago
- LLM reads a paper and produce a working prototype☆52Updated last week
- On-device LLM Inference using Mediapipe LLM Inference API.☆21Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- CrewAI AgentOps: Monitor your AI Agents☆17Updated 9 months ago
- ☆11Updated 10 months ago
- Medical Mixture of Experts LLM using Mergekit.☆20Updated last year
- ☆21Updated 5 months ago
- Modified Beam Search with periodical restart☆12Updated 7 months ago
- Tutorial for DSPy☆23Updated 11 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated last week
- ☆46Updated last year
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆64Updated 4 months ago
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆44Updated last year
- Build your own RAG and run it locally on your laptop: ColBERT + DSPy + Streamlit☆56Updated last year
- Chainlit app for advanced RAG. Uses llamaparse, langchain, qdrant and models from groq.☆45Updated 11 months ago
- Multimodal AI App using Llava 7B and Gradio.☆38Updated 11 months ago
- ☆16Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆44Updated last year