yoziru / nextjs-vllm-ui
Fully-featured, beautiful web interface for vLLM - built with NextJS.
☆110Updated 6 months ago
Alternatives and similar repositories for nextjs-vllm-ui:
Users that are interested in nextjs-vllm-ui are comparing it to the libraries listed below
- automatically quant GGUF models☆155Updated this week
- ☆124Updated 2 weeks ago
- ☆77Updated 2 months ago
- A fast batching API to serve LLM models☆180Updated 9 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆110Updated 7 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/☆85Updated 6 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.☆285Updated this week
- Low-Rank adapter extraction for fine-tuned transformers models☆169Updated 9 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆146Updated 9 months ago
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆173Updated 7 months ago
- A pipeline parallel training script for LLMs.☆124Updated 3 weeks ago
- 1.58-bit LLaMa model☆82Updated 10 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆444Updated 5 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆192Updated 7 months ago
- All the world is a play, we are but actors in it.☆47Updated this week
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated last month
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 4 months ago
- ☆192Updated 3 weeks ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆536Updated last week
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated 9 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 5 months ago
- ☆152Updated 7 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 6 months ago
- EfficientQAT: Efficient Quantization-Aware Training for Large Language Models☆246Updated 4 months ago
- A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automa…☆47Updated last week
- 🍲Agent Chef🥘 is my robust tool for dataset refinement, structuring, and generation. By leveraging procedural and synthetic dataset gene…☆19Updated 2 weeks ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆345Updated 3 weeks ago
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆41Updated 5 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆31Updated 7 months ago
- ☆252Updated 2 months ago