danielgross / ggml-k8sView external linksLinks
Run GGML models with Kubernetes.
☆175Dec 17, 2023Updated 2 years ago
Alternatives and similar repositories for ggml-k8s
Users that are interested in ggml-k8s are comparing it to the libraries listed below
Sorting:
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- ☆119Dec 18, 2024Updated last year
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Feb 27, 2025Updated 11 months ago
- A single notebook for fine-tuning GPT-3.5 turbo☆31Aug 16, 2024Updated last year
- ☆15Dec 22, 2023Updated 2 years ago
- ☆56Mar 7, 2023Updated 2 years ago
- Let's make sand talk☆588Oct 17, 2023Updated 2 years ago
- batched loras☆349Sep 6, 2023Updated 2 years ago
- ☆12Sep 26, 2023Updated 2 years ago
- ☆62Dec 8, 2023Updated 2 years ago
- [WIP] AI Try-On plugin for Chrome☆28Mar 16, 2024Updated last year
- ☆40Mar 25, 2024Updated last year
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 4 months ago
- Simplex Random Feature attention, in PyTorch☆76Oct 10, 2023Updated 2 years ago
- Seamless Voice Interactions with LLMs☆12Oct 28, 2023Updated 2 years ago
- ☆45Oct 13, 2023Updated 2 years ago
- Guess the Hacker News titles☆12Mar 24, 2022Updated 3 years ago
- extending laughbot project to encoder-based transformer model finetuned on same dataset for humor classification☆10Jan 4, 2023Updated 3 years ago
- ☆11Dec 11, 2024Updated last year
- ☆10Jul 17, 2023Updated 2 years ago
- A distributed execution framework built upon lunatic.☆16Jan 19, 2024Updated 2 years ago
- BH hackathon☆14Apr 4, 2024Updated last year
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆279Nov 3, 2023Updated 2 years ago
- inference code for mixtral-8x7b-32kseqlen☆105Dec 12, 2023Updated 2 years ago
- ☆3,372Feb 25, 2024Updated last year
- ☆27Mar 14, 2024Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆58Feb 19, 2024Updated last year
- ☆11Mar 18, 2024Updated last year
- ☆13Aug 10, 2023Updated 2 years ago
- UnrealBakedSDF is a sample Unreal project for importing and visualizing BakedSDF meshes.☆15Jun 14, 2023Updated 2 years ago
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆14Feb 9, 2026Updated last week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- Chatbot for The Carbon Almanac book or a climate change related topic☆16Mar 6, 2023Updated 2 years ago
- ☆13Oct 12, 2023Updated 2 years ago
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 3 months ago
- ☆10Oct 24, 2024Updated last year
- Apache Hive Metastore in Standalone Mode With Docker☆14Jul 22, 2024Updated last year
- ☆51Jan 31, 2024Updated 2 years ago