anarchy-ai / llm-speed-benchmarkLinks
Benchmarking tool for assessing LLM models' performance across different hardwares
☆17Updated 2 years ago
Alternatives and similar repositories for llm-speed-benchmark
Users that are interested in llm-speed-benchmark are comparing it to the libraries listed below
Sorting:
- Transformer GPU VRAM estimator☆67Updated last year
- vLLM with support for span semantics☆21Updated last month
- Realtime News and Information Eval☆16Updated 2 months ago
- StarListify is a Python package that classifies GitHub stars history into organized category lists based on user-defined criteria.☆33Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated last year
- ☆24Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆53Updated 2 years ago
- ☆115Updated 11 months ago
- Repository for opt-out requests.☆10Updated last year
- Comparison of Language Model Inference Engines☆239Updated last year
- ☆19Updated last year
- Using Large Language Models for Repo-wide Type Prediction☆114Updated 2 years ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago
- Benchmarking suite for popular AI APIs☆88Updated 11 months ago
- Implementation of nougat that focuses on processing pdf locally.☆84Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated 2 years ago
- Experimental wasm32-unknown-wasi runtime for Python code execution☆40Updated last year
- Benchmark structured generation libraries☆30Updated last year
- ☆477Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.☆154Updated 2 years ago
- AirLLM 70B inference with single 4GB GPU☆17Updated 7 months ago
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang☆100Updated this week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆46Updated 2 weeks ago
- Generates grammer files from typescript for LLM generation☆38Updated last year
- LLM <-> Python agentic runtime prototype☆114Updated 5 months ago
- Python client library for improving your LLM app accuracy☆97Updated 11 months ago
- A collection of all available inference solutions for the LLMs☆94Updated 10 months ago
- ☆198Updated last year
- ☆74Updated 2 years ago
- An endpoint server for efficiently serving quantized open-source LLMs for code.☆58Updated 2 years ago