fixie-ai / ai-benchmarksLinks

Benchmarking suite for popular AI APIs

☆87

Alternatives and similar repositories for ai-benchmarks

Users that are interested in ai-benchmarks are comparing it to the libraries listed below

Sorting:

fixie-ai / thefastest.ai
Website with current metrics on the fastest AI models.
☆43Updated 8 months ago
anyscale / llm-router
Tutorial for building LLM router
☆221Updated last year
fw-ai / benchmark
Benchmark suite for LLMs from Fireworks.ai
☆76Updated last week
ServiceNow / Fast-LLM
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
☆218Updated this week
h2oai / enterprise-h2ogpte
Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform
☆87Updated last month
ray-project / llmperf-leaderboard
☆463Updated last year
bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆139Updated last week
mani-kantap / llm-inference-solutions
A collection of all available inference solutions for the LLMs
☆91Updated 5 months ago
Cerebras / DocChat
GPT-4 Level Conversational QA Trained In a Few Hours
☆63Updated 11 months ago
nyunAI / PruneGPT
☆51Updated last year
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated last year
substratusai / sandboxai
Run AI generated code in isolated sandboxes
☆90Updated 6 months ago
teknium1 / LLM-Benchmark-Logs
Just a bunch of benchmark logs for different LLMs
☆119Updated last year
Preemo-Inc / text-generation-inference
☆199Updated last year
QuixiAI / OpenChatML
☆157Updated last year
AlexBodner / How_Much_VRAM
☆102Updated 11 months ago
qdrant / bm42_eval
Evaluation of bm42 sparse indexing algorithm
☆68Updated last year
premAI-io / benchmarks
🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
☆137Updated last year
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
log10-io / log10
Python client library for improving your LLM app accuracy
☆98Updated 5 months ago
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆187Updated 10 months ago
SalesforceAIResearch / SFR-RAG
☆77Updated 6 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
substratusai / vllm-docker
☆63Updated 4 months ago
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year
dust-tt / llama-ssp
Experiments on speculative sampling with Llama models
☆128Updated 2 years ago
aymeric-roucher / agent_reasoning_benchmark
🔧 Compare how Agent systems perform on several benchmarks. 📊🚀
☆99Updated this week
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆27Updated 7 months ago
read-agent / read-agent.github.io
☆64Updated last year
fw-ai / cookbook
Recipes and resources for building, deploying, and fine-tuning generative AI with Fireworks.
☆120Updated last week