ray-project / llmperf-leaderboardLinks

☆471

Alternatives and similar repositories for llmperf-leaderboard

Users that are interested in llmperf-leaderboard are comparing it to the libraries listed below

Sorting:

apoorvumang / prompt-lookup-decoding
☆572Updated last year
lapp0 / lm-inference-engines
Comparison of Language Model Inference Engines
☆231Updated 10 months ago
scaleapi / llm-engine
Scale LLM Engine public repository
☆813Updated last week
ray-project / llmperf
LLMPerf is a library for validating and benchmarking LLMs
☆1,032Updated 10 months ago
punica-ai / punica
Serving multiple LoRA finetuned LLM as one
☆1,101Updated last year
ray-project / ray-llm
RayLLM - LLMs on Ray (Archived). Read README for more info.
☆1,263Updated 7 months ago
anyscale / llm-router
Tutorial for building LLM router
☆231Updated last year
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆660Updated last year
modal-labs / llm-finetuning
Guide for fine-tuning Llama/Mistral/CodeLlama models and more
☆626Updated last week
arthur-ai / bench
A tool for evaluating LLMs
☆424Updated last year
huggingface / optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…
☆318Updated 3 weeks ago
Snowflake-Labs / snowflake-arctic
☆550Updated last year
philschmid / easyllm
☆464Updated last year
nexusflowai / NexusRaven
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…
☆316Updated 2 years ago
run-ai / llmperf
☆58Updated last year
sabetAI / BLoRA
batched loras
☆346Updated 2 years ago
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆722Updated last year
mistralai-sf24 / hackathon
☆446Updated last year
dzhulgakov / llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
☆368Updated last year
intel / neural-speed
An innovative library for efficient LLM inference via low-bit quantization
☆349Updated last year
runpod-workers / worker-vllm
The RunPod worker template for serving our large language model endpoints. Powered by vLLM.
☆371Updated last month
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆595Updated last year
Preemo-Inc / text-generation-inference
☆197Updated last year
triton-inference-server / vllm_backend
☆302Updated this week
FastEval / FastEval
Fast & more realistic evaluation of chat language models. Includes leaderboard.
☆189Updated last year
rizerphe / local-llm-function-calling
A tool for generating function arguments and choosing what function to call with local LLMs
☆430Updated last year
LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆304Updated last year
QuixiAI / OpenChatML
☆162Updated 2 months ago
neuralmagic / nm-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆266Updated last year
vllm-project / guidellm
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
☆655Updated this week