premAI-io / benchmarks
πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.
β136Updated 6 months ago
Alternatives and similar repositories for benchmarks:
Users that are interested in benchmarks are comparing it to the libraries listed below
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needsβ191Updated this week
- experiments with inference on llamaβ104Updated 8 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β121Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMsβ259Updated 4 months ago
- β113Updated 4 months ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ252Updated 7 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer modelsβ262Updated 2 weeks ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β203Updated 3 months ago
- β199Updated last year
- End-to-End LLM Guideβ101Updated 7 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.β191Updated 7 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'β233Updated 8 months ago
- β207Updated 7 months ago
- β224Updated this week
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ230Updated 3 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β67Updated 4 months ago
- Low-Rank adapter extraction for fine-tuned transformers modelsβ169Updated 9 months ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Daβ93Updated 2 months ago
- Set of scripts to finetune LLMsβ36Updated 10 months ago
- ποΈ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Oβ¦β286Updated 2 weeks ago
- A Lightweight Library for AI Observabilityβ233Updated this week
- Data preparation code for Amber 7B LLMβ85Updated 9 months ago
- Accelerating your LLM training to full speed! Made with β€οΈ by ServiceNow Researchβ134Updated this week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β82Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β147Updated 4 months ago
- Comparison of Language Model Inference Enginesβ204Updated 2 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creationβ103Updated 4 months ago
- Let's build better datasets, together!β252Updated 2 months ago
- Self-host LLMs with vLLM and BentoMLβ87Updated this week
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β29Updated 5 months ago