crashr / gppmLinks
GPU Power and Performance Manager
☆60Updated 9 months ago
Alternatives and similar repositories for gppm
Users that are interested in gppm are comparing it to the libraries listed below
Sorting:
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated 2 weeks ago
- ☆79Updated last week
- A frontend for creative writing with LLMs☆127Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆98Updated this week
- A fast batching API to serve LLM models☆183Updated last year
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆161Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 5 months ago
- Experimental LLM Inference UX to aid in creative writing☆114Updated 7 months ago
- ☆95Updated 6 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated 10 months ago
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆89Updated last month
- A multimodal, function calling powered LLM webui.☆214Updated 9 months ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆45Updated last year
- Easily view and modify JSON datasets for large language models☆78Updated 2 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated 10 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆54Updated 8 months ago
- Guide on text completion large language model fine-tuning, including example scripts and training data acquiring.☆77Updated 4 months ago
- InferX is a Inference Function as a Service Platform☆116Updated 2 weeks ago
- ☆49Updated 4 months ago
- Neo AI integrates into the Linux terminal, capable of executing system commands and providing helpful information.☆112Updated 2 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆71Updated 8 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆156Updated last year
- Open source LLM UI, compatible with all local LLM providers.☆175Updated 9 months ago
- Lightweight Inference server for OpenVINO☆188Updated this week
- ☆204Updated last month
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated 11 months ago
- ☆19Updated 9 months ago
- No-messing-around sh client for llama.cpp's server☆30Updated 11 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆125Updated 8 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆78Updated 9 months ago