yoziru / nextjs-vllm-ui
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆135Updated last month
Alternatives and similar repositories for nextjs-vllm-ui
Users that are interested in nextjs-vllm-ui are comparing it to the libraries listed below
Sorting:
- ☆89Updated 4 months ago
- A fast batching API to serve LLM models☆182Updated last year
- ☆130Updated 2 weeks ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆116Updated 10 months ago
- automatically quant GGUF models☆174Updated last week
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆309Updated last week
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 6 months ago
- Docker compose to run vLLM on Windows☆78Updated last year
- A multimodal, function calling powered LLM webui.☆214Updated 7 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 11 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆32Updated 10 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆86Updated 8 months ago
- ☆110Updated 6 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆253Updated 2 months ago
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆41Updated 7 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆152Updated 11 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆183Updated 9 months ago
- A pipeline parallel training script for LLMs.☆143Updated last week
- LLM inference in C/C++☆76Updated this week
- Open-source Perplexity app.☆122Updated last month
- Moxin is a family of fully open-source and reproducible LLMs☆92Updated 2 weeks ago
- A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automa…☆47Updated 2 months ago
- Using Langroid's Multi-Agent Framework to Build LLM Apps☆137Updated this week
- LLM inference in C/C++☆21Updated last month
- 1.58-bit LLaMa model☆81Updated last year
- ☆156Updated 9 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 8 months ago
- ☆202Updated 3 weeks ago
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files☆160Updated 3 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 10 months ago