yoziru / nextjs-vllm-uiLinks
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆159Updated 6 months ago
Alternatives and similar repositories for nextjs-vllm-ui
Users that are interested in nextjs-vllm-ui are comparing it to the libraries listed below
Sorting:
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆265Updated 8 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆380Updated this week
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆190Updated last year
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- ☆106Updated 2 months ago
- automatically quant GGUF models☆214Updated 3 weeks ago
- Docker compose to run vLLM on Windows☆106Updated last year
- A fast batching API to serve LLM models☆188Updated last year
- ☆133Updated 6 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆119Updated last year
- ☆124Updated last year
- ☆208Updated 2 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- One click templates for inferencing Language Models☆218Updated 3 months ago
- ☆51Updated 9 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆606Updated 9 months ago
- A pipeline parallel training script for LLMs.☆162Updated 6 months ago
- A multimodal, function calling powered LLM webui.☆216Updated last year
- 🚀 Retrieval Augmented Generation (RAG) with txtai. Combine search and LLMs to find insights with your own data.☆414Updated last week
- ☆173Updated 3 months ago
- Inference service for Qwen2.5-VL-7b model☆204Updated 7 months ago
- Easily view and modify JSON datasets for large language models☆84Updated 6 months ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web☆268Updated this week
- Distributed Inference for mlx LLm☆99Updated last year
- Open Source Text Embedding Models with OpenAI Compatible API☆160Updated last year
- an AI interaction tool with RAG hybrid search, conversation context, web content processing and structured data analysis with LLM / GPT☆205Updated 5 months ago
- Benchmarking the serving capabilities of vLLM☆56Updated last year
- LM inference server implementation based on *.cpp.☆290Updated 3 months ago
- Link you Ollama models to LM-Studio☆145Updated last year
- Ollama chat client in Vue, everything you need to do your private text rpg in browser☆135Updated last year