cduk / vllm-pascalLinks

A fork of vLLM enabling Pascal architecture GPUs

☆28

Alternatives and similar repositories for vllm-pascal

Users that are interested in vllm-pascal are comparing it to the libraries listed below

Sorting:

yoziru / nextjs-vllm-ui
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆146Updated 2 months ago
itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆214Updated 9 months ago
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆257Updated 4 months ago
epolewski / EricLLM
A fast batching API to serve LLM models
☆183Updated last year
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆156Updated last year
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆185Updated 11 months ago
RandomInternetPreson / Lucid_Vision
This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…
☆56Updated 8 months ago
chigkim / Ollama-MMLU-Pro
☆95Updated 6 months ago
antibitcoin / ReflectionAnyLLM
This project demonstrates a basic chain-of-thought interaction with any LLM (Large Language Model)
☆321Updated 9 months ago
leafspark / AutoGGUF
automatically quant GGUF models
☆187Updated this week
Lex-au / Vocalis
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…
☆188Updated 3 months ago
phildougherty / qwen2.5-VL-inference-openai
Inference service for Qwen2.5-VL-7b model
☆188Updated 3 months ago
rombodawg / Easy_training
☆49Updated 4 months ago
severian42 / MoA-Ollama-Chat
This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…
☆117Updated last year
v2rockets / Loyal-Elephie
Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible
☆319Updated 4 months ago
7ozzam / cohere-toolkit-with-openai
Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.
☆28Updated 5 months ago
aneeshjoy / vllm-windows
Docker compose to run vLLM on Windows
☆92Updated last year
atineiatte / deep-research-at-home
☆206Updated 2 months ago
phildougherty / qwen2.5_omni_chat
Service for testing out the new Qwen2.5 omni model
☆54Updated 2 months ago
akashjss / sesame-csm
A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.
☆192Updated 2 months ago
matteoserva / GraphLLM
☆204Updated last month
remichu-ai / gallama
☆131Updated 2 months ago
SingularityMan / vector_companion
A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…
☆222Updated last month
avarayr / suaveui
Open source LLM UI, compatible with all local LLM providers.
☆175Updated 9 months ago
TesslateAI / Agent-Builder
☆107Updated 2 months ago
masterFoad / NanoSage
Local LLM Powered Recursive Search & Smart Knowledge Explorer
☆244Updated 5 months ago
crashr / gppm
GPU Power and Performance Manager
☆60Updated 9 months ago
EtiennePerot / safe-code-execution
Code execution utilities for Open WebUI & Ollama
☆290Updated 8 months ago
sasha0552 / nvidia-pstate
A library and CLI utilities for managing performance states of NVIDIA GPUs.
☆27Updated 9 months ago
turboderp-org / exllamav3
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
☆436Updated this week