philschmid / llmperf
LLMPerf is a library for validating and benchmarking LLMs
☆10Updated 5 months ago
Alternatives and similar repositories for llmperf:
Users that are interested in llmperf are comparing it to the libraries listed below
- Large Language Model Hosting Container☆80Updated last week
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆217Updated this week
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated 10 months ago
- ☆10Updated 7 months ago
- A generative AI-powered framework for testing virtual agents.☆155Updated 3 weeks ago
- ☆15Updated 7 months ago
- ☆50Updated last month
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated last month
- ☆12Updated last week
- ☆51Updated 4 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆40Updated 9 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆55Updated 3 weeks ago
- ☆40Updated 2 months ago
- ☆30Updated 6 months ago
- Deploy your HPC Cluster on AWS in 20min. with just 1-Click.☆53Updated 4 months ago
- ☆60Updated last month
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆17Updated last year
- ☆18Updated 3 months ago
- Examples showing use of NGC containers and models withing Amazon SageMaker☆17Updated 2 years ago
- PyTorch implementation for MRL☆18Updated 10 months ago
- experiments with inference on llama☆104Updated 7 months ago
- A do-framework project to simplify deployment of Kubeflow on Amazon EKS☆20Updated last week
- 🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)☆17Updated 10 months ago
- Train, tune, and infer Bamba model☆76Updated this week
- Leverage your LangChain trace data for fine tuning☆40Updated 5 months ago
- ☆24Updated 10 months ago
- Example code for AWS Neuron SDK developers building inference and training applications☆132Updated this week
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines☆31Updated last year
- ☆41Updated 10 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year