fixie-ai / ai-benchmarks
Benchmarking suite for popular AI APIs
☆85Updated 3 months ago
Alternatives and similar repositories for ai-benchmarks
Users that are interested in ai-benchmarks are comparing it to the libraries listed below
Sorting:
- Benchmark suite for LLMs from Fireworks.ai☆72Updated this week
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated 3 weeks ago
- Website with current metrics on the fastest AI models.☆41Updated 6 months ago
- Self-host LLMs with vLLM and BentoML☆109Updated last week
- ☆77Updated 11 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 9 months ago
- ☆460Updated last year
- An experimental and alternative approach to Finetuning and RAG.☆35Updated last year
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 10 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 8 months ago
- Python client library for improving your LLM app accuracy☆98Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- A list of LLM benchmark frameworks.☆66Updated last year
- ☆199Updated last year
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated last year
- A framework for evaluating function calls made by LLMs☆37Updated 9 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆147Updated 3 months ago
- ☆37Updated 2 years ago
- ☆53Updated 11 months ago
- This open-source repository offers reference code for integrating workplace datastores with Cohere's LLMs, enabling developers and busine…☆150Updated 7 months ago
- ☆156Updated 10 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- ☆151Updated 5 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆118Updated 11 months ago
- A specification for OpenInference, a semantic mapping of ML inferences☆46Updated last year
- Collection of recipes aiding Gen AI model development☆106Updated last week
- ☆66Updated 11 months ago
- Inference server benchmarking tool☆59Updated 3 weeks ago
- ☆50Updated 5 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆36Updated last year