A high-throughput and memory-efficient inference and serving engine for LLMs
☆34Mar 21, 2024Updated last year
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,403Nov 29, 2024Updated last year
- Code for Discovering Preference Optimization Algorithms with and for Large Language Models☆192Jun 13, 2024Updated last year
- CycleQD is a framework for parameter space model merging.☆48Feb 1, 2025Updated last year
- A quick Crew AI tutorial☆23May 9, 2024Updated last year
- Apps related to Odoo it's planning features☆14Jun 13, 2024Updated last year
- ☆17Jun 7, 2023Updated 2 years ago
- ☆14Dec 5, 2025Updated 2 months ago
- ☆11Jun 13, 2023Updated 2 years ago
- ☆14Aug 7, 2025Updated 6 months ago
- connect gpt to gsheet☆12Mar 2, 2024Updated 2 years ago
- ☆13Dec 16, 2022Updated 3 years ago
- ☆16Mar 19, 2024Updated last year
- [TIP'20] Official Implementation of MEF-SSIMd☆32Nov 7, 2019Updated 6 years ago
- [RSS 2023] RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects☆40May 1, 2023Updated 2 years ago
- NodeJS client for communicating with the DhanHQ API.☆16Sep 20, 2024Updated last year
- to analyze martial arts motion with CMU OpenPose. License depends on OpenPose.☆11Mar 28, 2019Updated 6 years ago
- Code for the paper "RIR-in-a-Box : Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation" presented at Interspeech 20…☆15Sep 1, 2024Updated last year
- This is a FullStack Airbnb-Clone made with nextjs , prisma and mongodb. My personal Portfolio Website https://phantomgod-dev.netlify.app☆10Jun 4, 2024Updated last year
- Iot BeeHive Monitoring using Balena.io☆12Mar 7, 2022Updated 3 years ago
- ☆11Dec 16, 2023Updated 2 years ago
- Online Chat App using React JS☆10Feb 19, 2026Updated last week
- dynamic mesoscopic traffic simulation☆12Apr 12, 2022Updated 3 years ago
- Cpp and GCode for Programmable Digital Weaves☆11Nov 8, 2023Updated 2 years ago
- API server for interacting with decentralised identity functionality on the cheqd Network☆11Updated this week
- Rhino Compute extension for Nvidia Omniverse☆10Mar 10, 2023Updated 2 years ago
- Official code for AL-PINNS: Augmented Lagrangian relaxation method for Physics-Informed Neural Networks☆12Jul 29, 2023Updated 2 years ago
- ☆16Updated this week
- Command line pastebin for sharing terminal output.☆11Jul 29, 2021Updated 4 years ago
- TransientViT: A novel CNN - Vision Transformer hybrid real/bogus transient classifier for the Kilodegree Automatic Transient Survey☆10Nov 7, 2024Updated last year
- ChatGPT Advanced Voice Mode Gets an Avatar!☆11Sep 29, 2024Updated last year
- [ICLR 2024] Thin-shell Object Manipulations with Differentiable Physics Simulations☆53Jun 5, 2024Updated last year
- Runs your rivet graphs in a beautiful chat UI☆46Mar 4, 2024Updated 2 years ago
- ☆13Mar 27, 2025Updated 11 months ago
- This repository contains an attempt at using Graph Attention based Reinforcement Learning for graphical state space. The code also provid…☆10Jun 27, 2021Updated 4 years ago
- 道墨云印--树莓派硬件控制部分☆14Dec 8, 2022Updated 3 years ago
- PACE: Pacing Operator Learning to Accurate Optical Field Simulation for Complicated Photonic Devices, NeurIPs 2024☆16Dec 13, 2024Updated last year
- Helm charts repository☆16Oct 21, 2025Updated 4 months ago
- Block explorer for cosmos-sdk based application☆15Feb 23, 2023Updated 3 years ago
- Linked Data vocabulary and API for parliamentary and committee information systems☆18Mar 11, 2018Updated 7 years ago