kemingy / vllm-env
setup the env for vllm users
☆16Updated last year
Alternatives and similar repositories for vllm-env:
Users that are interested in vllm-env are comparing it to the libraries listed below
- Sentence Embedding as a Service☆15Updated last year
- Deploy ChatGLM on Modelz☆15Updated 2 years ago
- A collection of models built with ColossalAI☆32Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 3 months ago
- Rust bindings for CTranslate2☆14Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆42Updated last year
- Evaluation of bm42 sparse indexing algorithm☆65Updated 9 months ago
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆23Updated 7 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- ☆39Updated 2 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆15Updated 2 years ago
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Updated 6 months ago
- ☆16Updated 10 months ago
- Reasoning by Communicating with Agents☆26Updated 6 months ago
- Evaluation for AI apps and agent☆39Updated last year
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Updated last month
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- ☆18Updated last year
- ☆84Updated last year
- Score LLM pretraining data with classifiers☆55Updated last year
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆45Updated 6 months ago
- OpenAI compatible API for open source LLMs☆15Updated last year
- The multilingual variant of GLM, a general language model trained with autoregressive blank infilling objective☆62Updated 2 years ago
- ☆49Updated 4 months ago
- ☆14Updated last year
- kimi-chat 测试数据☆7Updated last year