⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.
☆147Jun 8, 2024Updated last year
Alternatives and similar repositories for nos
Users that are interested in nos are comparing it to the libraries listed below
Sorting:
- A Dockerfile builder for Machine Learning developers☆20May 3, 2024Updated last year
- Substrate TypeScript SDK☆10Sep 20, 2024Updated last year
- Rosbag2parquet transforms ROS .bag files into query friendlier .parquet files in C++ (ie. without going through python)☆17Sep 1, 2017Updated 8 years ago
- Simple orchestration for EC2 spot containers☆19Sep 27, 2024Updated last year
- ☆12Apr 1, 2024Updated last year
- ojjson is a library designed to facilitate JSON interactions with Ollama, a large language api (LLM). It leverages the power of Zod for s…☆12Nov 7, 2024Updated last year
- The (open-source part of) code to reproduce "BPPSA: Scaling Back-propagation by Parallel Scan Algorithm".☆13Jun 7, 2021Updated 4 years ago
- Hassle-free ML Pipelines on Kubernetes☆39May 28, 2023Updated 2 years ago
- Simple LLM inference server☆20Jun 13, 2024Updated last year
- ☆31Updated this week
- Deep Learning Inference in 35 Lines of Python☆22Mar 27, 2015Updated 10 years ago
- Berkeley OS Prelim Reading Notes☆15Sep 20, 2023Updated 2 years ago
- Example of a Streamlit data app powered by Vaex☆11Jul 7, 2022Updated 3 years ago
- Research tools for autonomous systems in Python☆62Dec 7, 2022Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- A guide to testing different runpod (and other linux VMs) configurations. Specifically the speed of LLM outputs☆17Jan 12, 2024Updated 2 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Fast and easy Jupyter notebooks☆13Mar 7, 2023Updated 3 years ago
- Adapting Self-Supervised Representations as a Latent Space for Efficient Generation☆40Oct 17, 2025Updated 5 months ago
- ☆12Feb 22, 2024Updated 2 years ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆48Sep 26, 2024Updated last year
- ☆17Dec 18, 2023Updated 2 years ago
- ☆17May 22, 2025Updated 9 months ago
- SGLang is fast serving framework for large language models and vision language models.☆33Nov 24, 2025Updated 3 months ago
- A web-app to explore topics using LLM (less typing and more clicks)☆68Updated this week
- ☆67May 23, 2025Updated 9 months ago
- A frontend for creative writing with LLMs☆156Jul 15, 2024Updated last year
- Universal connector to LLMs for Node.js & Bun☆30Updated this week
- A really tiny autograd engine☆100May 26, 2025Updated 9 months ago
- ☆24Dec 27, 2024Updated last year
- Automated LLM novelist☆46Apr 11, 2024Updated last year
- ☆14Dec 21, 2025Updated 2 months ago
- ☆16Sep 9, 2023Updated 2 years ago
- ☆33Oct 20, 2025Updated 4 months ago
- Official implementation of Half-Quadratic Quantization (HQQ)☆919Feb 26, 2026Updated 3 weeks ago
- ☆10Oct 1, 2019Updated 6 years ago
- Apache Arrow-compatible space-efficient "tape" class in pure Rust to be used with StringZilla for GPU, NUMA, and disk transfers of variab…☆29Nov 21, 2025Updated 3 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆3,736May 21, 2025Updated 9 months ago