crashr / gppm
GPU Power and Performance Manager
☆46Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for gppm
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆34Updated last week
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆45Updated last month
- ☆95Updated last week
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆44Updated last week
- A fast batching API to serve LLM models☆172Updated 6 months ago
- ☆30Updated 6 months ago
- A python application that routes incoming prompts to an LLM by category, and can support a single incoming connection from a front end to…☆160Updated 2 weeks ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆45Updated 3 months ago
- ☆25Updated last month
- No-messing-around sh client for llama.cpp's server☆27Updated 3 months ago
- A frontend for creative writing with LLMs☆106Updated 3 months ago
- ☆63Updated last month
- Easily view and modify JSON datasets for large language models☆62Updated last month
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 2 months ago
- An open source, Gradio-based chatbot app that combines the best of retrieval augmented generation and prompt engineering into an intellig…☆43Updated 3 months ago
- Something similar to Apple Intelligence?☆57Updated 4 months ago
- Experimental LLM Inference UX to aid in creative writing☆104Updated 3 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆95Updated 2 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆25Updated last week
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆42Updated 9 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 6 months ago
- Open source LLM UI, compatible with all local LLM providers.☆165Updated last month
- HTTP proxy for on-demand model loading with llama.cpp (or other OpenAI compatible backends)☆32Updated last week
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆55Updated this week
- A library and CLI utilities for managing performance states of NVIDIA GPUs.☆19Updated last month
- vnc-lm is a Discord bot with Ollama, OpenRouter, Mistral, Cohere, and Github Models API integration☆44Updated this week
- A simple light terminal style chat app that lets you use connect to your local llama.cpp server☆27Updated 4 months ago
- ☆110Updated 2 weeks ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆30Updated 3 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆81Updated 2 months ago