fixie-ai / ai-benchmarks
Benchmarking suite for popular AI APIs
☆82Updated last month
Alternatives and similar repositories for ai-benchmarks:
Users that are interested in ai-benchmarks are comparing it to the libraries listed below
- Website with current metrics on the fastest AI models.☆40Updated 4 months ago
- Benchmark suite for LLMs from Fireworks.ai☆70Updated last month
- An OpenAI Completions API compatible server for NLP transformers models☆64Updated last year
- Tutorial for building LLM router☆189Updated 8 months ago
- Self-host LLMs with vLLM and BentoML☆97Updated this week
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆84Updated 2 weeks ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆37Updated 11 months ago
- ☆152Updated 8 months ago
- Data preparation code for Amber 7B LLM☆86Updated 10 months ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆111Updated 9 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- ☆52Updated 11 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Collection of recipes aiding Gen AI model development☆100Updated 2 weeks ago
- A list of LLM benchmark frameworks.☆65Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆17Updated last year
- ☆84Updated last year
- Natural Language Interfaces Powered by LLMs☆90Updated 8 months ago
- A framework for evaluating function calls made by LLMs☆37Updated 8 months ago
- Chat Markup Language conversation library☆55Updated last year
- DSPY on action with OpenSource LLMs.☆68Updated 11 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆102Updated 3 months ago
- Score LLM pretraining data with classifiers☆54Updated last year
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆302Updated last month
- AI Assistant running within your browser.☆62Updated 3 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Updated 8 months ago
- Cray-LM unified training and inference stack.☆21Updated 2 months ago
- Simple examples using Argilla tools to build AI☆53Updated 4 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 11 months ago