Self-host LLMs with LMDeploy and BentoML
☆22Dec 26, 2025Updated 5 months ago
Alternatives and similar repositories for BentoLMDeploy
Users that are interested in BentoLMDeploy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Jun 9, 2025Updated last year
- MultiPaxos and Disk Paxos in TLA+ and PlusCal☆13Jan 23, 2023Updated 3 years ago
- An LLM leaderboard for stateful agents☆21Oct 16, 2025Updated 7 months ago
- ☆15Apr 26, 2025Updated last year
- Paper-reading notes for Berkeley OS prelim exam.☆14Aug 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Prompt templates for language models☆10Apr 7, 2026Updated 2 months ago
- Langchain + Docker + Neo4j☆10Oct 29, 2024Updated last year
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Source code analysis of Impala, PostgreSQL, Citus and Postgres-XL☆13Jan 16, 2017Updated 9 years ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆69Apr 11, 2025Updated last year
- Simple image compression/decompression algorithm using DWT (discrete wavelet transform) and RLE+Huffman encoding.☆11Nov 3, 2013Updated 12 years ago
- A TLA+ formalization of the algorithm described in "Paxos Made Simple"☆21Jan 28, 2025Updated last year
- how to build a sentence embedding application using BentoML☆15Mar 31, 2025Updated last year
- zero shot NER fine tuning☆14Mar 17, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Distributed IO-aware Attention algorithm☆24Sep 24, 2025Updated 8 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Sentence Embedding as a Service☆15Jun 30, 2025Updated 11 months ago
- ☆26Jun 5, 2026Updated last week
- Demonstrate Function Calling code portability across 4 AI Models: OpenAI, AzureOpenAI, VertexAI Gemini and Mistral AI.☆13Jun 7, 2024Updated 2 years ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- ☆13Oct 27, 2021Updated 4 years ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated 4 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆21Jun 3, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- The driver for LMCache core to run in vLLM☆66Feb 4, 2025Updated last year
- A throughput-oriented high-performance serving framework for LLMs☆964Mar 29, 2026Updated 2 months ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆11Dec 24, 2023Updated 2 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated 2 years ago
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆26Dec 21, 2025Updated 5 months ago