kemingy / vllm-envLinks
setup the env for vllm users
☆16Updated last year
Alternatives and similar repositories for vllm-env
Users that are interested in vllm-env are comparing it to the libraries listed below
Sorting:
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆38Updated last year
- ☆17Updated 2 years ago
- A collection of reproducible inference engine benchmarks☆31Updated 2 months ago
- Reasoning by Communicating with Agents☆28Updated last month
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆23Updated 9 months ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- Evaluation for AI apps and agent☆42Updated last year
- Sentence Embedding as a Service☆15Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆20Updated last month
- fastertransformer for codegeex model☆63Updated 2 years ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- ☆23Updated 4 months ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- ☆84Updated last year
- Deploy ChatGLM on Modelz☆15Updated 2 years ago
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆18Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- kimi-chat 测试数据☆7Updated last year
- ☆53Updated last year
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.☆53Updated last year
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Benchmark suite for LLMs from Fireworks.ai☆76Updated 2 weeks ago
- The minimal, ad-hoc way of plug and play NebulaGraph with pip install, even inside Colab Notebook!☆17Updated last year
- ☆16Updated last year
- Score LLM pretraining data with classifiers☆55Updated last year
- Evaluation of bm42 sparse indexing algorithm☆68Updated 11 months ago
- ☆34Updated last month