cduk / vllm-pascalLinks
A fork of vLLM enabling Pascal architecture GPUs
☆31Updated 10 months ago
Alternatives and similar repositories for vllm-pascal
Users that are interested in vllm-pascal are comparing it to the libraries listed below
Sorting:
- A fast batching API to serve LLM models☆189Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆192Updated last year
- ☆134Updated 3 weeks ago
- Fully-featured, beautiful web interface for vLLM - built with NextJS.☆166Updated 3 weeks ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Updated 11 months ago
- ☆51Updated 10 months ago
- GPU Power and Performance Manager☆64Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 10 months ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆259Updated 2 months ago
- ☆108Updated 4 months ago
- A multimodal, function calling powered LLM webui.☆217Updated last year
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆346Updated 10 months ago
- ☆210Updated 4 months ago
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆239Updated 2 months ago
- Docker compose to run vLLM on Windows☆112Updated 2 years ago
- ☆229Updated 8 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆117Updated last year
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- ☆178Updated 4 months ago
- automatically quant GGUF models☆219Updated 2 weeks ago
- Inference service for Qwen2.5-VL-7b model☆208Updated 9 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆100Updated 6 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated last year
- Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-scrip…☆67Updated last year
- Open source LLM UI, compatible with all local LLM providers.☆177Updated last year
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- ☆83Updated 10 months ago
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆58Updated 10 months ago
- A command-line personal assistant that integrates with Google Calendar, Gmail, and Tasks to help manage your digital life.☆129Updated 3 months ago
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆135Updated last year