kemingy / vllm-envLinks
setup the env for vllm users
☆16Updated last year
Alternatives and similar repositories for vllm-env
Users that are interested in vllm-env are comparing it to the libraries listed below
Sorting:
- Sentence Embedding as a Service☆15Updated 2 weeks ago
- A collection of models built with ColossalAI☆32Updated 2 years ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- A collection of reproducible inference engine benchmarks☆32Updated 2 months ago
- A memory efficient DLRM training solution using ColossalAI☆105Updated 2 years ago
- ☆84Updated last year
- Deploy ChatGLM on Modelz☆15Updated 2 years ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆40Updated last year
- Open Implementations of LLM Analyses☆105Updated 9 months ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated 2 years ago
- distill chatGPT coding ability into small model (1b)☆30Updated last year
- Reasoning by Communicating with Agents☆29Updated 2 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated last year
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- ASTChunk is a Python toolkit for code chunking using Abstract Syntax Trees (ASTs), designed to create structurally sound and meaningful c…☆30Updated 2 weeks ago
- Evaluation of bm42 sparse indexing algorithm☆68Updated last year
- The minimal, ad-hoc way of plug and play NebulaGraph with pip install, even inside Colab Notebook!☆17Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated 2 weeks ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆68Updated last year
- ☆16Updated last year
- ☆64Updated 2 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆43Updated last year
- ☆37Updated 2 years ago
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆24Updated 10 months ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- Evaluating tool-augmented LLMs in conversation settings☆85Updated last year
- OpenAI compatible API for open source LLMs☆15Updated last year
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆18Updated last year