Self-host LLMs with LMDeploy and BentoML
☆22Dec 26, 2025Updated 4 months ago
Alternatives and similar repositories for BentoLMDeploy
Users that are interested in BentoLMDeploy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Jun 9, 2025Updated 11 months ago
- This sample application demonstrates the practical implementation and usage patterns of the LLM Agents library.☆14Sep 7, 2024Updated last year
- Stateful LLM Serving☆102Mar 11, 2025Updated last year
- An LLM leaderboard for stateful agents☆21Oct 16, 2025Updated 7 months ago
- AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.☆16Dec 22, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This repository provides a flexible and customizable implementation of an advanced conversational AI agent, allowing you to leverage the …☆12Aug 31, 2024Updated last year
- Easy-to-use Retrieval-Enhanced Transformer implementation☆10Sep 30, 2022Updated 3 years ago
- Prompt templates for language models☆10Apr 7, 2026Updated last month
- Langchain + Docker + Neo4j☆10Oct 29, 2024Updated last year
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆69Apr 11, 2025Updated last year
- An AI assistant for PCs powered by Meta's LLaMA3 using Hugging Face, runs on voice recognition, text-to-speech. Send messages, voice/vide…☆19Jun 6, 2024Updated last year
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- Through this project we have comprehensively evaluated 10 workload predictors and determined which predictor works the best for Alibaba C…☆12Dec 5, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated 2 years ago
- Elevate your language models with insightful diversity metrics.☆11Feb 4, 2024Updated 2 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- zero shot NER fine tuning☆14Mar 17, 2025Updated last year
- ☆20Apr 24, 2023Updated 3 years ago
- Distributed IO-aware Attention algorithm☆24Sep 24, 2025Updated 7 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Demonstrate Function Calling code portability across 4 AI Models: OpenAI, AzureOpenAI, VertexAI Gemini and Mistral AI.☆13Jun 7, 2024Updated last year
- ☆13Oct 27, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Clean, uncluttered fluent APIs in .NET☆24Jun 10, 2020Updated 5 years ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆21Oct 28, 2025Updated 6 months ago
- A throughput-oriented high-performance serving framework for LLMs☆959Mar 29, 2026Updated last month
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆11Dec 24, 2023Updated 2 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 5 years ago
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆22Dec 21, 2025Updated 5 months ago
- ☆11May 18, 2025Updated last year
- SV-Sim: Scalable PGAS-based State Vector Simulation of Quantum Circuits☆22Feb 2, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated last year
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆19Jun 29, 2025Updated 10 months ago
- Details of the datasets for Few-shot class-incremental audio classification☆10Dec 6, 2023Updated 2 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Dec 14, 2021Updated 4 years ago
- 3D visualization of depth maps which are created by any depth model such as Monodepth, Packnet, etc..☆12Jun 9, 2020Updated 5 years ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆16Jun 12, 2023Updated 2 years ago
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆17Feb 11, 2023Updated 3 years ago