neuralmagic / guidellm
Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs
☆124Updated this week
Related projects: ⓘ
- ☆75Updated 3 weeks ago
- Tutorial for building LLM router☆144Updated 2 months ago
- awesome synthetic (text) datasets☆213Updated this week
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆129Updated last month
- experiments with inference on llama☆106Updated 3 months ago
- An Open Source Toolkit For LLM Distillation☆284Updated last month
- Synthetic Data for LLM Fine-Tuning☆78Updated 9 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆217Updated 2 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆101Updated last week
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆117Updated 3 weeks ago
- Let's build better datasets, together!☆195Updated last month
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆192Updated 4 months ago
- ☆201Updated 7 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- AWM: Agent Workflow Memory☆121Updated this week
- Formatron empowers everyone to control the format of language models' output with minimal overhead.☆116Updated this week
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆101Updated last week
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆80Updated 3 weeks ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆72Updated last week
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆134Updated 2 weeks ago
- Self-host LLMs with vLLM and BentoML☆61Updated this week
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆43Updated last month
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆93Updated 5 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆236Updated last week
- Just a bunch of benchmark logs for different LLMs☆112Updated last month
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 4 months ago
- ☆126Updated 2 months ago
- A simple Python sandbox for helpful LLM data agents☆143Updated 3 months ago
- ☆64Updated 3 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago