☆16Nov 24, 2025Updated 6 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Documentation for vLLM Dev Channel releases☆10Dec 5, 2024Updated last year
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago
- Video scrubbing with WebCodecs☆15Nov 4, 2025Updated 7 months ago
- Modification of SOMPY repo with robust K-means clustering (bootstrapped SSE elbow method)☆13Apr 6, 2019Updated 7 years ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- fast opus bindings for node and browsers☆15Feb 11, 2024Updated 2 years ago
- code for training and using chess embeddings models☆14Jun 9, 2024Updated 2 years ago
- Collection of ASR models for English TFLite models for faster inference.☆14Feb 21, 2022Updated 4 years ago
- meson android build PoC☆11Oct 29, 2019Updated 6 years ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆43Jan 15, 2024Updated 2 years ago
- A MetaMask-fork to support pluggable identity contracts.☆11Dec 30, 2022Updated 3 years ago
- ☆11Jun 26, 2017Updated 8 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- Gradient-based Hyperparameter Optimization Over Long Horizons☆14Sep 29, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Minimal, highly available (HA) Kubernetes cluster on Hetzner Cloud — up and running in under 10 minutes.☆11Apr 23, 2026Updated last month
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆285Updated this week
- OpenNNFX a community based open source repository of all things No Nonsense Forex☆19Apr 22, 2023Updated 3 years ago
- Fast and memory-efficient exact attention☆21Apr 10, 2026Updated 2 months ago
- Music generation using Elementary Cellular Automata.☆13Nov 23, 2015Updated 10 years ago
- Proxy for OpenAI☆16Sep 2, 2025Updated 9 months ago
- ☆17Nov 26, 2024Updated last year
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 9 months ago
- Multivariate Bayesian Structural Time Series in Stan☆13Apr 13, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Mar 6, 2021Updated 5 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated 2 years ago
- Contains a refined version of a vector trace of the Laughing Man logo from the anime "Ghost In The Shell - Stand Alone Complex" I did in …☆20Jan 8, 2018Updated 8 years ago
- Kubernetes deployment strategies from "DB Schemas & Kubernetes Rollouts" blogpost☆20May 8, 2019Updated 7 years ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Feb 20, 2024Updated 2 years ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆18May 20, 2025Updated last year
- This Repository contains the BitCoin Stock Price Prediction using LSTM Project.☆11Jul 13, 2020Updated 5 years ago
- ☆22Nov 9, 2024Updated last year
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)☆15Aug 16, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The official repository for AdaMuon☆39Aug 27, 2025Updated 9 months ago
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆18Nov 11, 2024Updated last year
- ☆13Jul 9, 2021Updated 4 years ago
- MMLU eval for RU/EN☆16Jul 31, 2023Updated 2 years ago
- butter is a btrfs snapshot manager.☆21Jun 4, 2018Updated 8 years ago
- Triton backend for managing the model state tensors automatically in sequence batcher☆17Feb 12, 2024Updated 2 years ago
- Parses 3d mesh data from Starfox rom☆19Feb 15, 2025Updated last year