Benchmark and optimize LLM inference across frameworks with ease
☆186Sep 12, 2025Updated 8 months ago
Alternatives and similar repositories for llm-optimizer
Users that are interested in llm-optimizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆36Nov 14, 2025Updated 6 months ago
- APEX+ is an LLM Serving Simulator☆46Jun 16, 2025Updated 11 months ago
- Dataset2024☆12Jun 12, 2025Updated 11 months ago
- A Practitioner's Guide to M(eow)ti Turn Agentic ReinfOrcement learning☆82Jan 16, 2026Updated 4 months ago
- Workshop that will take you from Graph Neural Networks (GNNs) to Transformers, architectures which have led to numerous breakthrough achi…☆12Sep 11, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Evals that meet you where you are. For AI that's grounded.☆68Mar 21, 2026Updated 2 months ago
- [ACL 2026] Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆52Apr 6, 2026Updated last month
- I built this project because there was no user friendly way to upload a file to a dockerized flask web form and have whisper do its thing…☆12Jul 28, 2025Updated 9 months ago
- Fine-tune FLUX 1.dev for personal AI photos☆22Sep 4, 2024Updated last year
- Baker is an AI powered app that helps you find recipes and avoid food waste☆14Jan 4, 2025Updated last year
- AI for a cure, a combination of Latent-GAN and VAE-JTNN to create 100% valid drug like molecules☆10Mar 16, 2020Updated 6 years ago
- 🚀 This repo is a showcase of how you can use models deployed on AWS SageMaker in your Haystack Retrieval Augmented Generative AI pipelin…☆13Jul 27, 2023Updated 2 years ago
- ☆13Oct 9, 2023Updated 2 years ago
- a high-quality, GPU-accelerated image resizer☆14Mar 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Apr 28, 2024Updated 2 years ago
- A unified library for building, evaluating, and storing speculative decoding algorithms for LLM inference in vLLM☆453Updated this week
- Python library for interacting with the Wikibase REST API☆13Sep 4, 2024Updated last year
- ☆11Updated this week
- XPO represents anatomical, cellular, and gene function phenotypes occurring throughout the development of the African frogs Xenopus laevi…☆11Jul 25, 2025Updated 9 months ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- 금융 도메인에 특화된 한국어 임베딩 모델☆23Aug 8, 2024Updated last year
- ☆61Dec 2, 2024Updated last year
- psychedelia syndrome: the pixels and code of a new kind of videogame☆15May 15, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repo contains detailed implementation information about Anthropic's paired prompts approach for evaluating political neutrality.☆132Nov 13, 2025Updated 6 months ago
- A simple tutorial for converting CSV to RDF☆10Mar 30, 2016Updated 10 years ago
- ☆27Aug 16, 2025Updated 9 months ago
- ☆31Apr 22, 2026Updated last month
- Category Theory for Quantum Natural Language Processing☆11Feb 22, 2023Updated 3 years ago
- Enemies for your LLM☆36Jan 20, 2026Updated 4 months ago
- Repository for the family history/pedigree project☆13Feb 24, 2026Updated 3 months ago
- Make reasoning models scalable☆49May 31, 2025Updated 11 months ago
- Efficient LLM Inference Acceleration using Prompting☆50Oct 22, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This is the code corresponding to my blog post "Generative Adversarial Networks (GANs) for Beginners: Generating Images of Distracted Dri…☆11Feb 5, 2019Updated 7 years ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆37Aug 4, 2025Updated 9 months ago
- Vector databases for generative AI☆22Apr 23, 2024Updated 2 years ago
- Simplified model deployment on llm-d☆29Jul 2, 2025Updated 10 months ago
- ☆13Jul 18, 2019Updated 6 years ago
- Text Clustering with Python and Dash☆10Mar 16, 2021Updated 5 years ago
- The Intelligent Inference Scheduler for Large-scale Inference Services.☆68Feb 12, 2026Updated 3 months ago