Modded vLLM to run pipeline parallelism over public networks
☆40May 20, 2025Updated last year
Alternatives and similar repositories for prime-vllm
Users that are interested in prime-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Solidity contracts for the decentralized Prime Network protocol☆26Jul 6, 2025Updated 10 months ago
- ☆138Mar 20, 2025Updated last year
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- Manage ML configuration with pydantic☆16Mar 18, 2026Updated 2 months ago
- A framework for PyTorch to enable fault management for collective communication libraries (CCL) such as NCCL☆20Feb 9, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆203Updated this week
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆136Nov 10, 2025Updated 6 months ago
- ☆21Apr 27, 2026Updated 3 weeks ago
- OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training☆573Jan 13, 2025Updated last year
- ☆33Apr 19, 2025Updated last year
- All-in-one Full-Featured Python/Flet/Flutter Application to make the most of all the latest Open-Source AI Art Generators in an intuitive…☆16May 30, 2025Updated 11 months ago
- ☆32Nov 14, 2024Updated last year
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆55Apr 14, 2025Updated last year
- ☆10Dec 21, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A graph database library for iOS and MacOS.☆14Mar 4, 2018Updated 8 years ago
- Easily benchmark Machine Learning models on selected tasks and datasets☆16May 22, 2023Updated 3 years ago
- Agentkube - Run Kubernetes Like Never Before☆38Mar 1, 2026Updated 2 months ago
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- Sparips.jl: Practical sparsification of Rips complexes☆11Jan 21, 2019Updated 7 years ago
- ☆11Dec 9, 2020Updated 5 years ago
- ☆22May 5, 2025Updated last year
- Receiver operating characteristic curve (ROC) computation code in C++☆11Jul 17, 2017Updated 8 years ago
- Libp2p bindings for Python☆12Mar 21, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆855Nov 16, 2025Updated 6 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Nov 18, 2025Updated 6 months ago
- Gradients on demand☆34Updated this week
- ☆10Oct 7, 2022Updated 3 years ago
- A library to use `modal` as a backend for `joblib`.☆32Jan 15, 2025Updated last year
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆101Mar 14, 2026Updated 2 months ago
- A 7B parameter model for mathematical reasoning☆42Updated this week
- ☆12Jul 10, 2023Updated 2 years ago
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆46Jan 11, 2024Updated 2 years ago
- ☆14Apr 16, 2025Updated last year
- An implementation of the Sequence to Sequence model using the Lasagne library (WIP)☆12Aug 11, 2016Updated 9 years ago
- Queue system for dispatching FFmpeg jobs, used for @uwutube, powered by @fastify and @redis☆10Feb 12, 2022Updated 4 years ago
- ☆16Mar 23, 2023Updated 3 years ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- Ship correct and fast LLM kernels to PyTorch☆150Jan 14, 2026Updated 4 months ago