ray-project / llmperf-leaderboard
☆430Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for llmperf-leaderboard
- LLMPerf is a library for validating and benchmarking LLMs☆645Updated 3 months ago
- ☆470Updated 2 months ago
- Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM☆685Updated this week
- Serving multiple LoRA finetuned LLM as one☆984Updated 6 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆811Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆165Updated 2 weeks ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆675Updated 7 months ago
- [NeurIPS'24 Spotlight] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces in…☆791Updated this week
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆723Updated 3 weeks ago
- A throughput-oriented high-performance serving framework for LLMs☆636Updated 2 months ago
- Automatically evaluate your LLMs in Google Colab☆559Updated 6 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆293Updated 11 months ago
- Fast parallel LLM inference for MLX☆149Updated 4 months ago
- batched loras☆336Updated last year
- A tool for evaluating LLMs☆392Updated 6 months ago
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆257Updated this week
- Official implementation of Half-Quadratic Quantization (HQQ)☆701Updated last week
- Minimalistic large language model 3D-parallelism training☆1,260Updated this week
- Scalable toolkit for efficient model alignment☆620Updated this week
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆373Updated 11 months ago
- ☆191Updated this week
- ☆451Updated 3 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,634Updated this week
- An Open Source Toolkit For LLM Distillation☆356Updated 2 months ago
- Comparison of Language Model Inference Engines☆190Updated 2 months ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆308Updated last year
- RayLLM - LLMs on Ray☆1,237Updated 5 months ago
- An innovative library for efficient LLM inference via low-bit quantization☆348Updated 2 months ago