anarchy-ai / llm-speed-benchmarkLinks
Benchmarking tool for assessing LLM models' performance across different hardwares
☆17Updated last year
Alternatives and similar repositories for llm-speed-benchmark
Users that are interested in llm-speed-benchmark are comparing it to the libraries listed below
Sorting:
- Transformer GPU VRAM estimator☆66Updated last year
- LLM code editor for backend services☆14Updated 8 months ago
- Runner in charge of collecting metrics from LLM inference endpoints for the Unify Hub☆17Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 6 months ago
- Machine learning library, Distributed training, Deep learning, Reinforcement learning, Models, TensorFlow, PyTorch☆67Updated this week
- llama.cpp gguf file parser for javascript☆43Updated 7 months ago
- A proxy that allows you to host ollama images in your local environment☆34Updated last year
- DockaShell is an MCP server that gives AI agents isolated Docker containers to work in. MCP tools for shell access, file operations, and …☆24Updated last month
- Deliver LLMs of GGUF format via Dockerfile.☆13Updated 8 months ago
- Guards and protection agnostic to your model or provider☆38Updated 7 months ago
- Blueprint by Mozilla.ai for answering questions about structured documents☆35Updated 4 months ago
- CaptureFlow - LLM-powered code maintenance that delivers reliable results.☆43Updated 11 months ago
- Resources regarding evML (edge verified machine learning)☆18Updated 6 months ago
- Horizon chart for CPU/GPU/Neural Engine utilization monitoring. Supports Apple M1-M4, Nvidia GPUs, AMD GPUs☆25Updated 3 weeks ago
- Tools for LLM agents.☆63Updated 6 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- LLM <-> Python agentic runtime prototype☆54Updated last week
- LLM-Powered Analyses of your GitHub Community using EvaDB☆24Updated last year
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆32Updated last year
- Rust crates for XetHub☆43Updated 9 months ago
- ☆46Updated last month
- StarListify is a Python package that classifies GitHub stars history into organized category lists based on user-defined criteria.☆25Updated 8 months ago
- Web browser version of StarCoder.cpp☆45Updated last year
- ☆20Updated last year
- ☆31Updated last year
- 360M model running in the browser on WebGPU☆22Updated 10 months ago
- ☆16Updated last year
- ☆116Updated 5 months ago
- An open source MCP proxy.☆13Updated 6 months ago
- ☆22Updated last year