cduk / vllm-pascalLinks
A fork of vLLM enabling Pascal architecture GPUs
☆28Updated 3 months ago
Alternatives and similar repositories for vllm-pascal
Users that are interested in vllm-pascal are comparing it to the libraries listed below
Sorting:
- automatically quant GGUF models☆181Updated this week
- A fast batching API to serve LLM models☆181Updated last year
- ☆90Updated 5 months ago
- ☆48Updated 3 months ago
- ☆129Updated last month
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆116Updated 11 months ago
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆90Updated 2 weeks ago
- A multimodal, function calling powered LLM webui.☆214Updated 8 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 4 months ago
- Service for testing out the new Qwen2.5 omni model☆51Updated last month
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆255Updated 3 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆54Updated 7 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆65Updated this week
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆86Updated last month
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 7 months ago
- ☆203Updated 2 weeks ago
- A library and CLI utilities for managing performance states of NVIDIA GPUs.☆26Updated 8 months ago
- Experimental LLM Inference UX to aid in creative writing☆113Updated 5 months ago
- Open source LLM UI, compatible with all local LLM providers.☆174Updated 8 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆66Updated 7 months ago
- ☆75Updated this week
- Easily view and modify JSON datasets for large language models☆75Updated 3 weeks ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆185Updated 10 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆35Updated last week
- ☆120Updated 2 weeks ago
- ☆198Updated 3 weeks ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆153Updated last year
- A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automa…☆48Updated 2 months ago
- Lightweight Inference server for OpenVINO☆180Updated this week
- GPU Power and Performance Manager☆59Updated 7 months ago