Benchmark and optimize LLM inference across frameworks with ease
☆183Sep 12, 2025Updated 7 months ago
Alternatives and similar repositories for llm-optimizer
Users that are interested in llm-optimizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Streamlit MongoDB Connector: An efficient connector for interfacing MongoDB with Streamlit apps, developed for the Streamlit Connections …☆11Dec 19, 2023Updated 2 years ago
- Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serv…☆294Apr 23, 2026Updated last week
- rudradb-opin-examples is for example implementations of the pip install rudradb-opin☆29Mar 3, 2026Updated 2 months ago
- Open-source asset-liability model.☆26Jul 31, 2025Updated 9 months ago
- I built this project because there was no user friendly way to upload a file to a dockerized flask web form and have whisper do its thing…☆12Jul 28, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This data analytics degree program focuses on both theory and application, “learn by doing” as you complete data science and analytics pr…☆10Nov 15, 2024Updated last year
- Fine-tune FLUX 1.dev for personal AI photos☆22Sep 4, 2024Updated last year
- ScrollNet for Continual Learning☆11Sep 11, 2023Updated 2 years ago
- AI for a cure, a combination of Latent-GAN and VAE-JTNN to create 100% valid drug like molecules☆10Mar 16, 2020Updated 6 years ago
- 🚀 This repo is a showcase of how you can use models deployed on AWS SageMaker in your Haystack Retrieval Augmented Generative AI pipelin…☆13Jul 27, 2023Updated 2 years ago
- ☆13Oct 9, 2023Updated 2 years ago
- OpenResty ENV cache☆12Nov 16, 2017Updated 8 years ago
- Train text generation model with JavaScript.☆15Jul 14, 2024Updated last year
- ☆28Nov 10, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Apr 28, 2024Updated 2 years ago
- Train transformer language models with reinforcement learning.☆19Feb 25, 2025Updated last year
- 금융 도메인에 특화된 한국어 임베딩 모델☆23Aug 8, 2024Updated last year
- Utility which provides a UI to do prompt engineering within SageMaker Studio.☆14Jul 5, 2023Updated 2 years ago
- ☆61Dec 2, 2024Updated last year
- Compositional Dietary Nutrition Ontology☆14Feb 15, 2026Updated 2 months ago
- ☆40Mar 30, 2026Updated last month
- Demo for ci/cd docker in aws ECS☆11Sep 20, 2018Updated 7 years ago
- CMP314 Optimizing NLP models with Amazon EC2 Inf1 instances in Amazon Sagemaker☆14Dec 20, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- This repo contains detailed implementation information about Anthropic's paired prompts approach for evaluating political neutrality.☆128Nov 13, 2025Updated 5 months ago
- High-performance, ergonomic Model Context Protocol (MCP) implementation in Rust☆13Jul 15, 2025Updated 9 months ago
- ☆13Apr 25, 2026Updated last week
- ☆27Aug 16, 2025Updated 8 months ago
- ☆30Apr 22, 2026Updated last week
- Linear Relational Embeddings (LREs) and Linear Relational Concepts (LRCs) for LLMs in PyTorch☆10Aug 7, 2024Updated last year
- Node.js module for the CLIP model.☆15Sep 16, 2024Updated last year
- Enemies for your LLM☆35Jan 20, 2026Updated 3 months ago
- DeepSeek-R1 JavaScript starter☆14Feb 16, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Make reasoning models scalable☆49May 31, 2025Updated 11 months ago
- This is the code corresponding to my blog post "Generative Adversarial Networks (GANs) for Beginners: Generating Images of Distracted Dri…☆11Feb 5, 2019Updated 7 years ago
- ☆112Apr 19, 2026Updated 2 weeks ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Aug 4, 2025Updated 9 months ago
- practice how to build knowledge graph from given text corpus☆33Sep 6, 2025Updated 7 months ago
- Reimplementation of ToMNet with some extensions for RL as well☆14Apr 28, 2018Updated 8 years ago
- A full-featured template / example project for Mapbox Studio using Mapbox Streets vector tiles☆16May 7, 2024Updated last year