philschmid / llmperf

LLMPerf is a library for validating and benchmarking LLMs

☆10

Alternatives and similar repositories for llmperf:

Users that are interested in llmperf are comparing it to the libraries listed below

awslabs / llm-hosting-container
Large Language Model Hosting Container
☆80Updated last week
huggingface / optimum-neuron
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
☆217Updated this week
S1M0N38 / dspy-arxiv
Explore the use of DSPy for extracting features from PDFs 🔎
☆38Updated 10 months ago
philschmid / huggingface-inferentia2-samples
☆10Updated 7 months ago
awslabs / agent-evaluation
A generative AI-powered framework for testing virtual agents.
☆155Updated 3 weeks ago
ai-hero / llm-research-fine-tuning
☆15Updated 7 months ago
awslabs / extending-the-context-length-of-open-source-llms
☆50Updated last month
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆34Updated last month
aws-samples / evaluating-large-language-models-using-llm-as-a-judge
☆12Updated last week
datacommonsorg / llm-tools
☆51Updated 4 months ago
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆40Updated 9 months ago
plaggy / rag-containers
Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.
☆55Updated 3 weeks ago
aws-samples / llm-evaluation-methodology
☆40Updated 2 months ago
patronus-ai / Lynx-hallucination-detection
☆30Updated 6 months ago
Stability-AI / stability-hpc
Deploy your HPC Cluster on AWS in 20min. with just 1-Click.
☆53Updated 4 months ago
cohere-ai / cohere-aws
☆60Updated last month
mlabonne / tinytuner
🐜🔧 A minimalistic tool to fine-tune your LLMs
☆17Updated last year
lancedb / ragged
☆18Updated 3 months ago
aws-samples / amazon-sagemaker-nvidia-ngc-examples
Examples showing use of NGC containers and models withing Amazon SageMaker
☆17Updated 2 years ago
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆18Updated 10 months ago
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated 7 months ago
aws-samples / aws-do-kubeflow
A do-framework project to simplify deployment of Kubeflow on Amazon EKS
☆20Updated last week
alvarobartt / vertex-ai-huggingface-inference-toolkit
🤗 HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)
☆17Updated 10 months ago
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆76Updated this week
parlance-labs / langfree
Leverage your LangChain trace data for fine tuning
☆40Updated 5 months ago
aws-samples / training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed
☆24Updated 10 months ago
aws-neuron / aws-neuron-samples
Example code for AWS Neuron SDK developers building inference and training applications
☆132Updated this week
wenqiglantz / edd-recursive-doc-agent-vs-metadata-replacement
Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelines
☆31Updated last year
aws-samples / generative-ai-workshop-build-a-multifunctional-chatbot-on-sagemaker
☆41Updated 10 months ago
explodinggradients / Funtuner
Supervised instruction finetuning for LLM with HF trainer and Deepspeed
☆34Updated last year