☆16Nov 24, 2025Updated 5 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a fork of SGLang for hip-attention integration. Please refer to hip-attention for detail.☆18Mar 31, 2026Updated last month
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago
- Video scrubbing with WebCodecs☆15Nov 4, 2025Updated 5 months ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Collection of ASR models for English TFLite models for faster inference.☆14Feb 21, 2022Updated 4 years ago
- ☆11Dec 11, 2024Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆43Jan 15, 2024Updated 2 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- SignalR Client for miniprogram, SignalR小程序客户端☆12Mar 21, 2019Updated 7 years ago
- ☆14Nov 15, 2024Updated last year
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆282Apr 24, 2026Updated last week
- ☆15Mar 6, 2021Updated 5 years ago
- An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"☆16Aug 23, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆13Oct 8, 2023Updated 2 years ago
- Fast and memory-efficient exact attention☆21Apr 10, 2026Updated 3 weeks ago
- ☆17Nov 26, 2024Updated last year
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 7 months ago
- An in-memory compressed cache for gigabytes of data written in Go.☆19Feb 6, 2023Updated 3 years ago
- This is for ASP.NET SignalR, not ASP.NET CORE SignalR☆12Jun 6, 2017Updated 8 years ago
- Contains a refined version of a vector trace of the Laughing Man logo from the anime "Ghost In The Shell - Stand Alone Complex" I did in …☆20Jan 8, 2018Updated 8 years ago
- Multi-Modal Language Modeling with Image, Audio and Text Integration, included multi-images and multi-audio in a single multiturn.☆18Feb 20, 2024Updated 2 years ago
- This Repository contains the BitCoin Stock Price Prediction using LSTM Project.☆11Jul 13, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Go wrapper for FUSE C low-level API.☆20Mar 19, 2026Updated last month
- ☆22Nov 9, 2024Updated last year
- ☆13Jul 9, 2021Updated 4 years ago
- MMLU eval for RU/EN☆16Jul 31, 2023Updated 2 years ago
- butter is a btrfs snapshot manager.☆21Jun 4, 2018Updated 7 years ago
- Parses 3d mesh data from Starfox rom☆18Feb 15, 2025Updated last year
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- Repository for centralised, re-usable Terraform modules☆13Updated this week
- Script Shortcut Addon for Blender☆21Dec 16, 2025Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- mcrouter as a docker image☆17Mar 3, 2015Updated 11 years ago
- Quickstart to Cilium☆17Oct 1, 2025Updated 7 months ago
- [EXPERIMENTAL] three.js loader for shaders created with Shade app for iOS☆29Oct 4, 2021Updated 4 years ago
- Sample CloudFormation template to create spot fleet request☆11Mar 23, 2016Updated 10 years ago
- ☆23Aug 26, 2024Updated last year
- Korean Abstract Meaning Representation (AMR) Corpus☆10Feb 27, 2022Updated 4 years ago
- ☆51Sep 3, 2025Updated 7 months ago