cduk / vllm-pascal
A fork of vLLM enabling Pascal architecture GPUs
☆25Updated last month
Alternatives and similar repositories for vllm-pascal:
Users that are interested in vllm-pascal are comparing it to the libraries listed below
- A fast batching API to serve LLM models☆183Updated 11 months ago
- automatically quant GGUF models☆164Updated this week
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆245Updated 3 weeks ago
- ☆46Updated last month
- Orpheus Chat WebUI☆32Updated last week
- ☆83Updated 3 months ago
- ☆125Updated last week
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆178Updated 8 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 7 months ago
- GPU Power and Performance Manager☆57Updated 5 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆55Updated last month
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 5 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆143Updated last week
- The main repository for building Pascal-compatible versions of ML applications and libraries.☆64Updated 2 weeks ago
- Service for testing out the new Qwen2.5 omni model☆17Updated last week
- A pipeline parallel training script for LLMs.☆136Updated this week
- Deploy Apollo HF space locally☆40Updated 3 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 6 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆65Updated 5 months ago
- ☆197Updated 2 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Updated 2 months ago
- A little(lil) Language Model (LM)☆47Updated last month
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆209Updated 2 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 5 months ago
- Easily view and modify JSON datasets for large language models☆72Updated last month
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆49Updated 6 months ago
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆64Updated 4 months ago
- ☆16Updated 9 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆148Updated 10 months ago
- ☆159Updated last week