IlyasMoutawwakil / llm-perf-backendLinks

The backend behind the LLM-Perf Leaderboard

☆10

Alternatives and similar repositories for llm-perf-backend

Users that are interested in llm-perf-backend are comparing it to the libraries listed below

Sorting:

IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated 2 months ago
aniketmaurya / fastserve-ai
Machine Learning Serving focused on GenAI with simplicity as the top priority.
☆59Updated 3 weeks ago
Alignment-Lab-AI / datagen
a pipeline for using api calls to agnostically convert unstructured data into structured training data
☆30Updated 10 months ago
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
chainyo / tensorshare
🤝 Trade any tensors over the network
☆30Updated last year
pacman100 / peft-codegen-25
☆23Updated 2 years ago
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
stas00 / ml-ways
ML/DL Math and Method notes
☆62Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
titanml / takeoff-community
TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…
☆114Updated last year
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 5 months ago
kevinwu23 / StanfordFineTuneBench
☆31Updated 8 months ago
fw-ai / benchmark
Benchmark suite for LLMs from Fireworks.ai
☆76Updated this week
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated last year
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
awslabs / extending-the-context-length-of-open-source-llms
☆56Updated last month
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated last year
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 6 months ago
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆137Updated last year
substratusai / vllm-docker
☆63Updated 4 months ago
Preemo-Inc / text-generation-inference
☆199Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
rwightman / genalog
Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…
☆42Updated last year
davanstrien / data-for-fine-tuning-llms
☆77Updated last year
Snowflake-Labs / vllm
☆15Updated 4 months ago
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
skypilot-org / skypilot-tutorial
Tutorial to get started with SkyPilot!
☆58Updated last year
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆66Updated 9 months ago
cray-lm / cray-lm
Cray-LM unified training and inference stack.
☆22Updated 6 months ago