tensorchord / modelz-llm
OpenAI compatible API for LLMs and embeddings (LLaMA, Vicuna, ChatGLM and many others)
β272Updated last year
Alternatives and similar repositories for modelz-llm:
Users that are interested in modelz-llm are comparing it to the libraries listed below
- πΎπ¦ CodeBoxAPI is the simplest sandboxing infrastructure for your LLM Apps and Services.β334Updated 3 weeks ago
- Local LLM ReAct Agent with Guidanceβ156Updated last year
- β580Updated last year
- πΈ Integrating AI plugins to LLMsβ229Updated last year
- Autoscale LLM (vLLM, SGLang, LMDeploy) inferences on Kubernetes (and others)β251Updated last year
- An open-source cloud-native of large multi-modal models (LMMs) serving framework.β161Updated last year
- β276Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platformβ82Updated last week
- An OpenAI-like LLaMA inference APIβ113Updated last year
- β152Updated 7 months ago
- AI for all: Build the large graph of the language modelsβ254Updated 8 months ago
- Open Source Text Embedding Models with OpenAI Compatible APIβ145Updated 7 months ago
- A command-line interface to generate textual and conversational datasets with LLMs.β294Updated last year
- β352Updated last year
- Official repository for LongChat and LongEvalβ519Updated 8 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytesβ¦β147Updated last year
- Lightweight chat AI platform featuring custom knowledge, open-source LLMs, prompt-engineering, retrieval analysis. Highly customizable. Fβ¦β201Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchβ¦β584Updated last year
- ChatData π π brings RAG to real applications with FREEβ¨ knowledge bases. Now enjoy your chat with 6 million wikipedia pages and 2 milliβ¦β163Updated 3 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ130Updated 7 months ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.β54Updated last year
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.β285Updated this week
- Playground for developing ChatGPT pluginsβ158Updated last year
- β39Updated last year
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.β356Updated 9 months ago
- β199Updated last year
- A tool for generating function arguments and choosing what function to call with local LLMsβ409Updated 11 months ago
- β268Updated last year
- β494Updated 6 months ago
- Visual Studio Code extension for WizardCoderβ145Updated last year