crashr / gppm
GPU Power and Performance Manager
☆57Updated 5 months ago
Alternatives and similar repositories for gppm:
Users that are interested in gppm are comparing it to the libraries listed below
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆55Updated last month
- Easily view and modify JSON datasets for large language models☆71Updated 3 weeks ago
- ☆52Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 2 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 6 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 11 months ago
- ☆32Updated 10 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆65Updated 4 months ago
- Writing Extension for Text Generation WebUI☆54Updated 2 months ago
- ☆66Updated last month
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 6 months ago
- Text generation in Python, as easy as possible☆56Updated 2 weeks ago
- Prompt Jinja2 templates for LLMs☆31Updated 3 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 7 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆64Updated 4 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 5 months ago
- automatically quant GGUF models☆164Updated last week
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆64Updated 5 months ago
- A library and CLI utilities for managing performance states of NVIDIA GPUs.☆25Updated 5 months ago
- Lightweight Inference server for OpenVINO☆143Updated this week
- A fast batching API to serve LLM models☆183Updated 11 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆38Updated 7 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆21Updated this week
- A frontend for creative writing with LLMs☆122Updated 8 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆116Updated 5 months ago
- ☆46Updated last month
- Use smol agents to do research and then update csv coumns with its findings.☆37Updated 2 months ago
- Generate Your Own Private Morning Radio for Commute☆34Updated last month
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆44Updated last year
- An API for VoiceCraft.☆25Updated 9 months ago