danielgross / ggml-k8s
Run GGML models with Kubernetes.
☆173Updated last year
Alternatives and similar repositories for ggml-k8s:
Users that are interested in ggml-k8s are comparing it to the libraries listed below
- ☆136Updated last year
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆162Updated last year
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆153Updated last year
- A collection of LLM services you can self host via docker or modal labs to support your applications development☆184Updated 8 months ago
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆100Updated last week
- run paligemma in real time☆129Updated 7 months ago
- AI sends pull requests for features you request in natural language☆113Updated last year
- run embeddings in MLX☆81Updated 3 months ago
- ☆85Updated 3 months ago
- On-device intelligence.☆216Updated 4 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆101Updated last year
- automatic sentence highlights based on their significance to the document☆181Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated 8 months ago
- Demo of AI chatbot that predicts user message to generate response quickly.☆100Updated 10 months ago
- llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆128Updated this week
- Start a server from the MLX library.☆166Updated 5 months ago
- ☆107Updated 3 weeks ago
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆116Updated 4 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆48Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆131Updated 6 months ago
- ☆38Updated 10 months ago
- ☆112Updated last year
- Just a bunch of benchmark logs for different LLMs☆116Updated 5 months ago
- Simple Transformer in Jax☆128Updated 6 months ago
- Some of the scripts I use for scribepod @ https://scribepod.substack.com/, an automated AI podcast☆172Updated last year
- Your automated SWE fleet to get your tickets from the Backlog to Prod!☆95Updated 8 months ago
- Helpers and such for working with Lambda Cloud☆51Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year