yoziru / nextjs-vllm-uiLinks
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆166Updated 3 weeks ago
Alternatives and similar repositories for nextjs-vllm-ui
Users that are interested in nextjs-vllm-ui are comparing it to the libraries listed below
Sorting:
- automatically quant GGUF models☆219Updated 2 weeks ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 10 months ago
- ☆108Updated 4 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆117Updated last year
- Docker compose to run vLLM on Windows☆112Updated 2 years ago
- A fast batching API to serve LLM models☆189Updated last year
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆192Updated last year
- ☆210Updated 4 months ago
- ☆51Updated 10 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆118Updated last year
- ☆134Updated 3 weeks ago
- GPU Power and Performance Manager☆64Updated last year
- ☆127Updated last year
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated last year
- Service for testing out the new Qwen2.5 omni model☆61Updated 8 months ago
- A pipeline parallel training script for LLMs.☆165Updated 8 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- A python package for developing AI applications with local LLMs.☆151Updated last year
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆86Updated last week
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- ☆178Updated 4 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Updated 11 months ago
- Inference service for Qwen2.5-VL-7b model☆208Updated 9 months ago
- 1.58-bit LLaMa model☆83Updated last year
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆67Updated last year
- ☆30Updated last year
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆135Updated last year
- Aggregates compute from spare GPU capacity☆183Updated 2 weeks ago