anarchy-ai / llm-speed-benchmarkLinks
Benchmarking tool for assessing LLM models' performance across different hardwares
☆17Updated 2 years ago
Alternatives and similar repositories for llm-speed-benchmark
Users that are interested in llm-speed-benchmark are comparing it to the libraries listed below
Sorting:
- Transformer GPU VRAM estimator☆67Updated last year
- vLLM with support for span semantics☆21Updated 2 weeks ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated 11 months ago
- Implementation of nougat that focuses on processing pdf locally.☆83Updated 11 months ago
- ☆198Updated last year
- Unofficial python bindings for the rust llm library. 🐍❤️🦀☆76Updated 2 years ago
- Benchmark structured generation libraries☆30Updated last year
- Aider's refactoring benchmark exercises based on popular python repos☆78Updated last year
- ☆115Updated 10 months ago
- ☆142Updated 2 years ago
- codellama on CPU without Docker☆25Updated last year
- ☆19Updated last year
- Drop in replacement for OpenAI, but with Open models.☆153Updated 2 years ago
- ☆23Updated last year
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated 2 years ago
- Verbosity control for AI agents☆64Updated last year
- Command line tool for Deep Infra cloud ML inference service☆33Updated last year
- StarListify is a Python package that classifies GitHub stars history into organized category lists based on user-defined criteria.☆31Updated last year
- ☆164Updated 4 months ago
- Realtime News and Information Eval☆15Updated last month
- Leverage your LangChain trace data for fine tuning☆46Updated last year
- Fast parallel LLM inference for MLX☆238Updated last year
- ☆74Updated 2 years ago
- Chat Markup Language conversation library☆55Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated 2 years ago
- Python client library for improving your LLM app accuracy☆97Updated 10 months ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 6 months ago
- Benchmarking suite for popular AI APIs☆88Updated 10 months ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆23Updated 2 years ago
- EXO Gym is an open-source Python toolkit that facilitates distributed AI research.☆88Updated 3 weeks ago