GGUF Quantization of any LLM.
☆42Mar 4, 2024Updated 2 years ago
Alternatives and similar repositories for GGUF-Quantization-of-any-LLM
Users that are interested in GGUF-Quantization-of-any-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code Llama GGUF Demo☆10Aug 28, 2023Updated 2 years ago
- Medical Mixture of Experts LLM using Mergekit.☆20Mar 6, 2024Updated 2 years ago
- A WordPress plugin that integrates ChatGPT to your website☆13Nov 20, 2023Updated 2 years ago
- ☆15May 17, 2024Updated last year
- ☆18Dec 18, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆21Feb 11, 2024Updated 2 years ago
- 部署大模型到Android设备☆20Nov 16, 2023Updated 2 years ago
- SQLGPT is an advanced SQL query generator powered by natural language processing. Seamlessly transforming plain English queries into comp…☆10Oct 24, 2023Updated 2 years ago
- Tools for CrewAI☆30May 12, 2024Updated last year
- Slack: #team-frontends-champions☆16Apr 24, 2025Updated last year
- GGUF parser for Go☆14Mar 8, 2026Updated 2 months ago
- Run models distributed as GGUF files using LLM☆87Nov 21, 2024Updated last year
- Streamlit apps on Cloud Run with Identity-Aware Proxy (IAP).☆10Mar 5, 2022Updated 4 years ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Streamlit AWS Cognito Integration Example☆25Jun 4, 2024Updated last year
- A bot that posts random SoundCloud comments.☆12Apr 27, 2026Updated 2 weeks ago
- ☆12Sep 13, 2024Updated last year
- Port of Facebook's LLaMA model in C/C++☆32Mar 7, 2024Updated 2 years ago
- Tutorial on how to train a custom voice recognition model using Hugging face models.☆11Jul 2, 2023Updated 2 years ago
- PathRAG System - A Path-based Retrieval-Augmented Generation implementation with knowledge graph visualization and Ollama integration for…☆13Mar 3, 2025Updated last year
- This project is from the Airbnb Recruitment Challenge on Kaggle. The challenge is to solve a multi-class classification problem of predic…☆11Feb 22, 2022Updated 4 years ago
- A web interface for SleekDB written in PHP☆11Jan 22, 2022Updated 4 years ago
- General information about DEEP BERLIN's AI for Good Hackathon 2020☆11Apr 14, 2020Updated 6 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Ask Poddy: Run Open Source LLMs and Embeddings as OpenAI-Compatible Serverless Endpoints (Tutorial)☆11Jul 19, 2024Updated last year
- Temporary mail - Keep your real mailbox clean and secure. Temp Mail provides temporary, secure, anonymous, free, disposable email address…☆13Mar 17, 2023Updated 3 years ago
- A better #! runner than /usr/bin/env☆22Jun 22, 2012Updated 13 years ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- ☆10Sep 14, 2022Updated 3 years ago
- Ollama with RAG and Chainlit is a chatbot project leveraging Ollama, RAG, and Chainlit. It uses Chromadb for vector storage, gpt4all for …☆14Feb 15, 2024Updated 2 years ago
- A simple website to manage your Hyper-V VMs and IIS sites☆12Jan 19, 2023Updated 3 years ago
- ☆14Sep 18, 2024Updated last year
- pinout☆12Oct 20, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- automatically quant GGUF models☆224Dec 23, 2025Updated 4 months ago
- ☆10May 29, 2020Updated 5 years ago
- Streaming AI assistant with ChatGPT, FastAPI, WebSockets and React ✨🤖🚀☆26Nov 12, 2023Updated 2 years ago
- A Ruby library to parse the content out of web pages, such as BBC News pages. Used by the News Sniffer project.☆29Jan 4, 2026Updated 4 months ago
- ☆12May 12, 2025Updated 11 months ago
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆15Oct 5, 2023Updated 2 years ago
- Finetuning a codegen model with python instruction set using QLORA technique for better efficacy☆11Aug 31, 2023Updated 2 years ago