yoziru / nextjs-vllm-uiLinks
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆172Updated last month
Alternatives and similar repositories for nextjs-vllm-ui
Users that are interested in nextjs-vllm-ui are comparing it to the libraries listed below
Sorting:
- automatically quant GGUF models☆219Updated last month
- ☆209Updated 3 weeks ago
- Docker compose to run vLLM on Windows☆114Updated 2 years ago
- ☆109Updated 5 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆266Updated 10 months ago
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated last year
- ☆135Updated last month
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- Distributed Inference for mlx LLm☆100Updated last year
- A fast batching API to serve LLM models☆188Updated last year
- Service for testing out the new Qwen2.5 omni model☆62Updated 9 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆393Updated last week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆88Updated this week
- ☆30Updated last year
- ☆178Updated 5 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆193Updated last year
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated last year
- A simple, intuitive toolkit for quickly implementing LLM powered applications.☆272Updated last year
- Inference service for Qwen2.5-VL-7b model☆209Updated 10 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆118Updated last year
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆134Updated last year
- A python package for developing AI applications with local LLMs.☆150Updated last year
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated last year
- ☆128Updated last year
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆67Updated last year
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆211Updated 7 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆48Updated 3 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆245Updated last year
- Unsloth Studio☆125Updated 9 months ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web☆275Updated 3 weeks ago