☆56Nov 18, 2024Updated last year
Alternatives and similar repositories for llm-bench
Users that are interested in llm-bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- kernel development code for my work (ioatdma, ntb_hw_intel, idxd, PCI, and CXL related bits)☆12Jan 19, 2026Updated 3 months ago
- Self-host LLMs with vLLM and BentoML☆170Mar 3, 2026Updated 2 months ago
- ☆18Aug 19, 2024Updated last year
- ☆12Mar 16, 2022Updated 4 years ago
- ☆25Apr 23, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- High-performance GEMM implementation optimized for NVIDIA H100 GPUs, leveraging Hopper architecture's TMA, WGMMA, and Thread Block Cluste…☆10Dec 4, 2024Updated last year
- Tutorial Exercises and Code for GPU Communications Tutorial at HOT Interconnects 2025☆31Oct 22, 2025Updated 6 months ago
- Test Orchestrator for Performance and Scalability of AI pLatforms☆17Apr 22, 2026Updated 2 weeks ago
- PyTorch Code for the Paper: "Exploiting Uncertainty of Loss Landscape for Stochastic Optimization [Bhaskara et al. (2019)]☆16Apr 30, 2026Updated last week
- ☆25Dec 30, 2025Updated 4 months ago
- ☆16May 14, 2025Updated 11 months ago
- 一个移动终端的轻量级前端类库☆17May 24, 2013Updated 12 years ago
- ☆10Feb 17, 2026Updated 2 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]☆29Feb 20, 2026Updated 2 months ago
- Benchmark suite for LLMs from Fireworks.ai☆101Updated this week
- Pragmatic approach to parsing import profiles for CI's☆12Jul 1, 2024Updated last year
- A cross-platform and editor-agnostic live previewer for Markdown files☆11Jul 15, 2024Updated last year
- git tracking for python notebooks☆12Jun 15, 2017Updated 8 years ago
- Multi-agent system for booking appointments and generating PDF invoices☆13Jul 16, 2025Updated 9 months ago
- LLM Serving Performance Evaluation Harness☆85Feb 25, 2025Updated last year
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆98Feb 10, 2026Updated 2 months ago
- ☆55Aug 1, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- LLMPerf is a library for validating and benchmarking LLMs☆1,111Dec 9, 2024Updated last year
- An recognition oriented deep learning framework for biometric sample quality assessment☆12Aug 24, 2023Updated 2 years ago
- ☆22Jan 23, 2024Updated 2 years ago
- ☆12Mar 28, 2023Updated 3 years ago
- Unofficial Pytorch implementation of MiniLM and MiniLMv2☆23Jan 30, 2022Updated 4 years ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Dec 22, 2023Updated 2 years ago
- some mixture of experts architecture implementations☆27Mar 22, 2024Updated 2 years ago
- Ultimate DPDK System Enabling Expert☆11May 3, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Jun 30, 2024Updated last year
- ☆22Oct 1, 2024Updated last year
- Pocket Survival Guide for Sys Admin - http://psg.skinforum.org/ -☆15May 2, 2026Updated last week
- ☆11Nov 5, 2021Updated 4 years ago
- Jax implementation of the AdaHessian optimizer☆20Mar 11, 2021Updated 5 years ago
- ☆15Apr 26, 2022Updated 4 years ago
- The code runs on the netronome smart card to filtering PPPoE and PPP control plane packet send to vbras and Decap\Encap data plane packet…☆11Jun 21, 2017Updated 8 years ago