cduk / vllm-pascalLinks
A fork of vLLM enabling Pascal architecture GPUs
☆28Updated 4 months ago
Alternatives and similar repositories for vllm-pascal
Users that are interested in vllm-pascal are comparing it to the libraries listed below
Sorting:
- A fast batching API to serve LLM models☆183Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆186Updated 11 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 5 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 9 months ago
- automatically quant GGUF models☆184Updated last week
- Analyze Reddit posts☆25Updated 3 months ago
- ☆130Updated 2 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆54Updated 8 months ago
- ☆95Updated 6 months ago
- GPU Power and Performance Manager☆59Updated 8 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆255Updated 3 months ago
- Service for testing out the new Qwen2.5 omni model☆52Updated last month
- A library and CLI utilities for managing performance states of NVIDIA GPUs.☆27Updated 8 months ago
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆95Updated last month
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated last week
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆89Updated 2 weeks ago
- Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.☆57Updated 4 months ago
- Experimental LLM Inference UX to aid in creative writing☆114Updated 6 months ago
- ☆49Updated 4 months ago
- ☆204Updated last month
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated last year
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆317Updated 3 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆70Updated 7 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆116Updated 11 months ago
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆219Updated 2 weeks ago
- A pipeline parallel training script for LLMs.☆149Updated last month
- Deploy Apollo HF space locally☆40Updated 6 months ago
- Easily view and modify JSON datasets for large language models☆76Updated last month
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆31Updated 2 months ago
- InferX is a Inference Function as a Service Platform☆111Updated last week