Benchmarking suite for popular AI APIs
☆88Feb 6, 2025Updated last year
Alternatives and similar repositories for ai-benchmarks
Users that are interested in ai-benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python client SDK for Ultravox.☆16Dec 10, 2025Updated 5 months ago
- Use LLMs to clean your gmail inbox☆22Dec 23, 2023Updated 2 years ago
- ☆25Apr 1, 2026Updated 2 months ago
- LLM benchmarks☆13Feb 22, 2024Updated 2 years ago
- Self-host LLMs with vLLM and BentoML☆169Mar 3, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Aug 10, 2022Updated 3 years ago
- A distributed in-memory store for temporal knowledge graphs☆10Mar 20, 2024Updated 2 years ago
- Extract a single expert from a Mixture Of Experts model using slerp interpolation.☆19May 26, 2024Updated 2 years ago
- Local FAISS vector store as an MCP server – Agent Memory, drop-in local semantic search for Claude / Copilot / Agents.☆30Apr 24, 2026Updated last month
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- An MCP tool for indexing your Rust code to work with agents like Kiro CLI and Claude Code.☆24Feb 5, 2026Updated 4 months ago
- ☆121Apr 23, 2026Updated last month
- Pure Java Llama2 inference with optional multi-GPU CUDA implementation☆13Sep 2, 2023Updated 2 years ago
- ☆17Mar 16, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- JAX Scalify: end-to-end scaled arithmetics☆18Oct 30, 2024Updated last year
- Fork of Tensorpack to make breaking performance improvements to the Mask RCNN example. Training is approximately 2x faster than the origi…☆39May 13, 2026Updated 3 weeks ago
- Python library for the ServoSix motor controller from Monkmakes☆11Jul 1, 2019Updated 6 years ago
- Port of Detectron2 to train/deploy model on Amazon Sagemaker☆16Mar 5, 2021Updated 5 years ago
- Calculating weather symbols to represent a specific weather situation☆13Nov 10, 2022Updated 3 years ago
- The repository contains code for Adaptive Data Optimization☆36Dec 9, 2024Updated last year
- ☆477Jan 10, 2024Updated 2 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Jul 22, 2025Updated 10 months ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 7 months ago
- Applications for audio inference at the edge on a raspberry-pi☆13Nov 16, 2021Updated 4 years ago
- Implementation of the paper: "Anomaly Detection in Continuous-Time Temporal Provenance Graphs". The code follows the CTDG framework https…☆23Nov 22, 2023Updated 2 years ago
- Example of applying CUDA graphs to LLaMA-v2☆11Aug 25, 2023Updated 2 years ago
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆19Jan 12, 2026Updated 4 months ago
- A student driven lego detection system☆10Apr 7, 2021Updated 5 years ago
- LLM Serving Performance Evaluation Harness☆84Feb 25, 2025Updated last year
- Ouroboros: Speculative Decoding with Large Model Enhanced Drafting (EMNLP 2024 main)☆117Mar 20, 2025Updated last year
- Published version of composing programs textbook☆15Mar 8, 2014Updated 12 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- 基于FPGA-Pynq的车牌识别系统。The LPR system of FPGA-Pynq☆13Mar 22, 2019Updated 7 years ago
- GART: Graph Analysis on Relational Transactional Datasets☆21Oct 30, 2024Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Nov 13, 2023Updated 2 years ago
- Compare NVIDIA Video Codec SDK's, PyAV's, and OpenCV's performance on video decoding.☆12Dec 18, 2022Updated 3 years ago
- ☆11Aug 22, 2023Updated 2 years ago
- ☆16Aug 19, 2024Updated last year
- Development repository for the Triton language and compiler☆25Sep 17, 2025Updated 8 months ago