crashr / gppmLinks
GPU Power and Performance Manager
☆61Updated 11 months ago
Alternatives and similar repositories for gppm
Users that are interested in gppm are comparing it to the libraries listed below
Sorting:
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆82Updated this week
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆66Updated 10 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Updated 7 months ago
- A frontend for creative writing with LLMs☆134Updated last year
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆164Updated last year
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆96Updated 2 weeks ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆164Updated last year
- Easily view and modify JSON datasets for large language models☆82Updated 4 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆73Updated 10 months ago
- ☆50Updated 6 months ago
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated last year
- ☆83Updated last week
- ☆83Updated 6 months ago
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- automatically quant GGUF models☆200Updated this week
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated 10 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 10 months ago
- A local front-end for open-weight LLMs with memory, RAG, TTS/STT, Elo ratings, and dynamic research tools. Built with React and FastAPI.☆37Updated last month
- Open source LLM UI, compatible with all local LLM providers.☆174Updated 11 months ago
- A fast batching API to serve LLM models☆187Updated last year
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆100Updated 3 weeks ago
- Writing Extension for Text Generation WebUI☆63Updated last month
- ☆209Updated last week
- ☆99Updated 3 weeks ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated last year
- Lightweight Inference server for OpenVINO☆210Updated last week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆54Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆260Updated 6 months ago
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆80Updated 11 months ago
- A library and CLI utilities for managing performance states of NVIDIA GPUs.☆28Updated 11 months ago