philschmid / llmperf
LLMPerf is a library for validating and benchmarking LLMs
☆10Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for llmperf
- Large Language Model Hosting Container☆78Updated 3 weeks ago
- ☆12Updated 6 months ago
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆18Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- Multi-Turn Chatbot with GPT-Neo and SageMaker: A conversational AI system for engaging and informative interactions with users.☆7Updated last year
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆209Updated this week
- ☆48Updated 2 weeks ago
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆93Updated 5 months ago
- ☆37Updated 2 weeks ago
- ☆73Updated 10 months ago
- ☆64Updated 4 months ago
- Self-host LLMs with vLLM and BentoML☆74Updated last week
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- Build Agentic workflows with function calling☆20Updated this week
- Tutorial to get started with SkyPilot!☆56Updated 6 months ago
- experiments with inference on llama☆105Updated 5 months ago
- ☆100Updated 2 months ago
- Open Implementations of LLM Analyses☆94Updated last month
- Streamlit app for recommending eval functions using prompt diffs☆25Updated 10 months ago
- ☆41Updated 9 months ago
- ☆47Updated 2 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆50Updated 8 months ago
- A generative AI-powered framework for testing virtual agents.☆108Updated last month
- ☆20Updated 10 months ago
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generation☆94Updated this week
- ☆75Updated 5 months ago
- ☆15Updated 5 months ago
- A simple Streamlit application that helps visualize document chunks and queries in embedding space 🗺️🔍☆12Updated 2 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated last month
- 🔎 A deep-dive into HyDE for Advanced LLM RAG + 💡 Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, covera…☆29Updated 7 months ago