cduk / vllm-pascal
A fork of vLLM enabling Pascal architecture GPUs
☆24Updated 2 months ago
Alternatives and similar repositories for vllm-pascal:
Users that are interested in vllm-pascal are comparing it to the libraries listed below
- idea: https://github.com/nyxkrage/ebook-groupchat/☆85Updated 6 months ago
- GPU Power and Performance Manager☆55Updated 4 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆52Updated 4 months ago
- Deploy Apollo HF space locally☆40Updated 2 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 4 months ago
- automatically quant GGUF models☆154Updated this week
- A fast batching API to serve LLM models☆180Updated 9 months ago
- ☆45Updated this week
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆26Updated last month
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆57Updated 3 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆146Updated 9 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆225Updated 2 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 6 months ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆134Updated 4 months ago
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆150Updated 9 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆110Updated 7 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆53Updated this week
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆173Updated 7 months ago
- Easily view and modify JSON datasets for large language models☆71Updated last week
- A frontend for creative writing with LLMs☆117Updated 7 months ago
- Docker compose to run vLLM on Windows☆62Updated last year
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆51Updated this week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆31Updated 7 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆76Updated 3 weeks ago
- ☆124Updated 2 weeks ago
- SoftWhisper simplifies audio and video transcription using the powerful Whisper model. Easily select custom models, languages, and tasks,…☆46Updated 4 months ago
- Experimental LLM Inference UX to aid in creative writing☆112Updated 2 months ago
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆136Updated last week