fixie-ai / ai-benchmarks
Benchmarking suite for popular AI APIs
☆74Updated last week
Related projects ⓘ
Alternatives and complementary repositories for ai-benchmarks
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆81Updated 2 months ago
- Self-host LLMs with vLLM and BentoML☆72Updated last week
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- Tutorial for building LLM router☆159Updated 3 months ago
- Website with current metrics on the fastest AI models.☆34Updated 2 weeks ago
- ☆148Updated 3 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆160Updated last week
- Routing on Random Forest (RoRF)☆83Updated last month
- ☆64Updated 5 months ago
- A collection of all available inference solutions for the LLMs☆72Updated last month
- A pipeline for LLM knowledge distillation☆77Updated 3 months ago
- Data preparation code for Amber 7B LLM☆82Updated 6 months ago
- Collection of recipes aiding Gen AI model development☆83Updated this week
- Benchmark suite for LLMs from Fireworks.ai☆58Updated this week
- ☆200Updated 9 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- An experimental and alternative approach to Finetuning and RAG.☆35Updated 11 months ago
- ☆114Updated 6 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆221Updated 6 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆89Updated this week
- ☆35Updated last year
- Python client library for improving your LLM app accuracy☆96Updated this week
- Self-hosted LLM chatbot arena, with yourself as the only judge☆36Updated 9 months ago
- Solving data for LLMs - Create quality synthetic datasets!☆136Updated 3 weeks ago
- Fast parallel LLM inference for MLX☆145Updated 4 months ago
- Vector Database with support for late interaction and token level embeddings.☆53Updated last month
- Commit0: Library Generation from Scratch☆97Updated last week
- A simple Python sandbox for helpful LLM data agents☆162Updated 4 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆133Updated 3 months ago