Self-host LLMs with LMDeploy and BentoML
☆22Dec 26, 2025Updated 2 months ago
Alternatives and similar repositories for BentoLMDeploy
Users that are interested in BentoLMDeploy are comparing it to the libraries listed below
Sorting:
- ☆20Jun 9, 2025Updated 8 months ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Exploration☆10Dec 24, 2023Updated 2 years ago
- Code for running experiments and benchmarking on GNNExplainer: Generating Explanations for Graph Neural Networks☆15May 8, 2021Updated 4 years ago
- ☆12Sep 19, 2022Updated 3 years ago
- ☆11May 16, 2025Updated 9 months ago
- Recent papers on Graph Neural Networks-based Recommender System.☆12Aug 21, 2023Updated 2 years ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated 3 weeks ago
- Research sources on graph-based anomaly detection☆13Nov 29, 2022Updated 3 years ago
- ☆14Jun 10, 2025Updated 8 months ago
- ☆16Feb 22, 2025Updated last year
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- ComfyUI-VRAM-Manager is an independent memory management custom node for ComfyUI. Provides Distorch memory management functionality for e…☆21Jan 23, 2026Updated last month
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆14Apr 30, 2025Updated 10 months ago
- Prompt templates for language models☆10Feb 22, 2026Updated last week
- 从零开始,系统掌握 Anthropic Claude 的核心能力与最佳实践☆22Updated this week
- In this project work, the main motive is to build a deep learning model to detect air pollution from real-time images. In order to achiev…☆12Oct 24, 2021Updated 4 years ago
- API serving for your diffusers models☆11Jan 19, 2024Updated 2 years ago
- Frequency domain (Fast Fourier Transform) and time-frequency (wavelet transform) feature extraction from Electrocardiogram (ECG) data.☆11Apr 30, 2022Updated 3 years ago
- ☆12Apr 24, 2024Updated last year
- ☆10Feb 21, 2023Updated 3 years ago
- ☆15Apr 26, 2025Updated 10 months ago
- Text Classification Dataset for Turkish Language☆10Nov 16, 2021Updated 4 years ago
- Implementation of Reinforce for educational purposes.☆12Jun 12, 2023Updated 2 years ago
- Discover the simplicity and efficiency of Void Linux on your Android device with VoidMagic! 🚀☆10Dec 11, 2023Updated 2 years ago
- zero shot NER fine tuning☆14Mar 17, 2025Updated 11 months ago
- A library for simplifying training with multi gpu setups in the HuggingFace / PyTorch ecosystem.☆16Jan 9, 2026Updated last month
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated last year
- CRUD Word documents with Python☆13Feb 5, 2026Updated 3 weeks ago
- Clustering using Deep Learning (T-SNE visualization of autoencoder embeddings )☆10Mar 3, 2019Updated 6 years ago
- how to build a sentence embedding application using BentoML☆14Mar 31, 2025Updated 11 months ago
- ☆11Nov 16, 2023Updated 2 years ago
- Mining User-aware Multi-relations for Fake News Detection in Large Scale Online Social Networks (WSDM 2023)☆13Jan 4, 2023Updated 3 years ago
- Ruby on Rails template engine that allows for multiple formats being laid out in a single specification.☆13Jan 28, 2013Updated 13 years ago
- 本项目旨在构建一套多场景下可复用的辅助决策型智能 Agent 系统。通过提取用户输入的关键信息,结合历史数据进行智能匹配,系统可在教育路径、法律咨询、金融投资、心理健康、企业经营、供应链优化、危机应对、智能客服等多个领域提供个性化决策建议。系统采用统一的决策流程设计,具备高…☆20Jul 22, 2025Updated 7 months ago
- A sd-webui extension for utilizing DanTagGen to "upsample prompts".☆13Jun 13, 2024Updated last year
- API server for F5-TTS☆20Jan 24, 2026Updated last month
- Deep Learning with Multiple Objectives: 2021 edition☆10May 27, 2021Updated 4 years ago
- Deepseek-CoT☆10Oct 6, 2024Updated last year