ParisNeo / vllm_proxy_serverLinks
A vllm proxy server to add security and multi model management for vllm servers
☆12Updated last year
Alternatives and similar repositories for vllm_proxy_server
Users that are interested in vllm_proxy_server are comparing it to the libraries listed below
Sorting:
- ☆68Updated last year
- Complex RAG backend☆29Updated last year
- AI-augmented, conversational information retrieval and data exploration☆37Updated last year
- automatically quant GGUF models☆219Updated last month
- Simple Chainlit UI for running llms from Groq and LangChain☆17Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- Your Python AI Coder!☆35Updated 7 months ago
- Advanced Coding AI Assistant that uses a Gradio interface to stream coding related responses. ChatRAG supports local and API inference an…☆23Updated 7 months ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- Forces DeepSeek R1 models to engage in extended reasoning by intercepting early termination tokens.☆19Updated 10 months ago
- ☆117Updated last year
- Mixture-of-Ollamas☆30Updated last year
- Locally running LLM with internet access☆97Updated 5 months ago
- Prompt Jinja2 templates for LLMs☆35Updated 5 months ago
- Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF☆33Updated last year
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆33Updated 2 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- Streamlit Web UI for AGiXT☆28Updated 2 weeks ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Updated last year
- ☆134Updated last week
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆136Updated last year
- This repository contains a web application designed to execute relatively compact, locally-operated Large Language Models (LLMs).☆43Updated 9 months ago
- ☆14Updated 2 weeks ago
- Auto Data is a library designed for quick and effortless creation of datasets tailored for fine-tuning Large Language Models (LLMs).☆103Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- ToolAgents is a lightweight and flexible framework for creating function-calling agents with various language models and APIs.☆27Updated last week
- ⚡️🧪 Fast LLM Tool Calling Experimentation, big and smol☆152Updated last year
- Small Multimodal Vision Model "Imp-v1-3b" trained using Phi-2 and Siglip.☆17Updated last year