GGUF Quantization of any LLM.
☆42Mar 4, 2024Updated 2 years ago
Alternatives and similar repositories for GGUF-Quantization-of-any-LLM
Users that are interested in GGUF-Quantization-of-any-LLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code Llama GGUF Demo☆10Aug 28, 2023Updated 2 years ago
- Medical Mixture of Experts LLM using Mergekit.☆20Mar 6, 2024Updated 2 years ago
- An easy-to-use ML pipeline package for Python inspired by scikit-learn pipeline and PyTorch layers.☆12Aug 27, 2023Updated 2 years ago
- Multimodal AI App using Llava 7B and Gradio.☆39Apr 30, 2024Updated last year
- ☆15May 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Dec 18, 2023Updated 2 years ago
- SLIM Models by LLMWare. A streamlit app showing the capabilities for AI Agents and Function Calls.☆20Feb 11, 2024Updated 2 years ago
- Recreation of the BBC News Map that allows for quick selection of counties and towns☆23Oct 19, 2011Updated 14 years ago
- 部署大模型到Android设备☆20Nov 16, 2023Updated 2 years ago
- ☆17Updated this week
- SQLGPT is an advanced SQL query generator powered by natural language processing. Seamlessly transforming plain English queries into comp…☆10Oct 24, 2023Updated 2 years ago
- Create your own Android RTMP/RTMPS/SRT live streaming application in less than 5 minutes!☆28Feb 4, 2026Updated 2 months ago
- Slack: #team-frontends-champions☆16Apr 24, 2025Updated 11 months ago
- Raw source code from Lions' Commentary on UNIX 6th Edition☆16Mar 27, 2013Updated 13 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- GGUF parser for Go☆14Mar 8, 2026Updated last month
- Series of python scripts and MapReduce programs to extract, parse and display Reddit data☆11Oct 10, 2022Updated 3 years ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated 2 years ago
- Tacotron 2 training notebook supporting Japanese, French, and Mandarin☆11Nov 19, 2022Updated 3 years ago
- A bot that posts random SoundCloud comments.☆12Dec 4, 2025Updated 4 months ago
- Conveniently download files, models, tokenizers from HuggingFace Hub☆58Apr 10, 2026Updated last week
- ☆12Sep 13, 2024Updated last year
- PathRAG System - A Path-based Retrieval-Augmented Generation implementation with knowledge graph visualization and Ollama integration for…☆13Mar 3, 2025Updated last year
- A web interface for SleekDB written in PHP☆11Jan 22, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 11 months ago
- Temporary mail - Keep your real mailbox clean and secure. Temp Mail provides temporary, secure, anonymous, free, disposable email address…☆13Mar 17, 2023Updated 3 years ago
- Collection of datasets for network research.☆14Jul 26, 2020Updated 5 years ago
- [ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"☆13Jul 26, 2021Updated 4 years ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- Ollama with RAG and Chainlit is a chatbot project leveraging Ollama, RAG, and Chainlit. It uses Chromadb for vector storage, gpt4all for …☆14Feb 15, 2024Updated 2 years ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆34Apr 11, 2026Updated last week
- This repository contains the code for our paper "Augmenting Black-box LLMs with Medical Textbooks for Clinical Question Answering" [EMNLP…☆15Oct 8, 2024Updated last year
- Where I will be storing misc files with details / links used during the installation process, etc☆13Sep 9, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- automatically quant GGUF models☆223Dec 23, 2025Updated 3 months ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Aug 9, 2024Updated last year
- Streaming AI assistant with ChatGPT, FastAPI, WebSockets and React ✨🤖🚀☆26Nov 12, 2023Updated 2 years ago
- A Ruby library to parse the content out of web pages, such as BBC News pages. Used by the News Sniffer project.☆29Jan 4, 2026Updated 3 months ago
- GPTNERMED is a language model-generated, synthetic dataset and an open neural NER model for medical entities designed for German data.☆16Oct 5, 2023Updated 2 years ago
- https://hapo31.github.io/charcoal☆11Dec 27, 2020Updated 5 years ago
- A vllm proxy server to add security and multi model management for vllm servers☆12May 30, 2024Updated last year