fixie-ai / ai-benchmarks
Benchmarking suite for popular AI APIs
☆81Updated last month
Alternatives and similar repositories for ai-benchmarks:
Users that are interested in ai-benchmarks are comparing it to the libraries listed below
- Self-host LLMs with vLLM and BentoML☆92Updated this week
- Website with current metrics on the fastest AI models.☆40Updated 4 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆207Updated this week
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆83Updated last month
- Tutorial for building LLM router☆186Updated 7 months ago
- DSPY on action with OpenSource LLMs.☆68Updated 11 months ago
- ☆76Updated 9 months ago
- ☆99Updated 6 months ago
- Benchmark suite for LLMs from Fireworks.ai☆69Updated last month
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆87Updated 4 months ago
- ☆199Updated last year
- ☆152Updated 7 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 7 months ago
- ☆117Updated 10 months ago
- ☆447Updated last year
- ☆65Updated 9 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆253Updated 8 months ago
- Routing on Random Forest (RoRF)☆130Updated 5 months ago
- Evaluation of bm42 sparse indexing algorithm☆64Updated 8 months ago
- ☆36Updated last year
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- ☆111Updated last month
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆63Updated 11 months ago
- ☆169Updated this week
- An experimental and alternative approach to Finetuning and RAG.☆35Updated last year
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆336Updated 8 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆88Updated this week
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- AI Evaluation Platform☆46Updated this week
- ☆73Updated last month