kemingy / vllm-env
setup the env for vllm users
☆16Updated last year
Alternatives and similar repositories for vllm-env:
Users that are interested in vllm-env are comparing it to the libraries listed below
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆37Updated last year
- Reasoning by Communicating with Agents☆28Updated last week
- Sentence Embedding as a Service☆15Updated last year
- Deploy ChatGLM on Modelz☆15Updated 2 years ago
- A collection of reproducible inference engine benchmarks☆29Updated 2 weeks ago
- Yet another coding assistant powered by LLM.☆16Updated 7 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 6 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 3 years ago
- ☆17Updated last year
- Evaluation for AI apps and agent☆41Updated last year
- Efficient and Scalable Estimation of Tool Representations in Vector Space☆23Updated 8 months ago
- A collection of models built with ColossalAI☆32Updated 2 years ago
- The minimal, ad-hoc way of plug and play NebulaGraph with pip install, even inside Colab Notebook!☆17Updated 11 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated last month
- ☆17Updated 2 years ago
- ☆41Updated 4 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated last year
- ☆24Updated 3 months ago
- distill chatGPT coding ability into small model (1b)☆29Updated last year
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆17Updated last year
- ☆27Updated last week
- OVALChat is a customizable Web app aimed at conducting user studies with chatbots☆28Updated last year
- Evaluation of bm42 sparse indexing algorithm☆65Updated 9 months ago
- A Python implementation of Toolformer using Huggingface Transformers☆14Updated 2 years ago
- A lightweight script for processing HTML page to markdown format with support for code blocks☆79Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆20Updated last year