Modded vLLM to run pipeline parallelism over public networks
☆40May 20, 2025Updated 10 months ago
Alternatives and similar repositories for prime-vllm
Users that are interested in prime-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Asynchronous P2P communication backend for decentralized pipeline parallelism☆42Jun 9, 2025Updated 9 months ago
- Solidity contracts for the decentralized Prime Network protocol☆26Jul 6, 2025Updated 8 months ago
- Estimate the throughput of OAI compatible servers☆21Mar 10, 2026Updated 2 weeks ago
- ☆137Mar 20, 2025Updated last year
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- Manage ML configuration with pydantic☆16Updated this week
- A framework for PyTorch to enable fault management for collective communication libraries (CCL) such as NCCL☆20Feb 9, 2026Updated last month
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆137Nov 10, 2025Updated 4 months ago
- ☆32Apr 19, 2025Updated 11 months ago
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆52Apr 14, 2025Updated 11 months ago
- Generic implementation of the Number Theoretic Transform in the context of cryptography applications☆14Aug 13, 2025Updated 7 months ago
- a libp2p-backed daemon wrapping the functionalities of go-libp2p for use in other languages☆11Feb 9, 2025Updated last year
- Lightly-reviewed collection of community environments☆219Mar 12, 2026Updated last week
- Easily benchmark Machine Learning models on selected tasks and datasets☆16May 22, 2023Updated 2 years ago
- ChatGPT connected to the web to have no more restrictions and be able to summarize the latest informations after 2021☆10Mar 3, 2023Updated 3 years ago
- Projects from the Succinct ZK Residency☆20Oct 23, 2024Updated last year
- APAR: LLMs Can Do Auto-Parallel Auto-Regressive Decoding☆14Jul 22, 2024Updated last year
- Pending transaction stream in rust☆27May 31, 2021Updated 4 years ago
- Receiver operating characteristic curve (ROC) computation code in C++☆11Jul 17, 2017Updated 8 years ago
- ☆22May 5, 2025Updated 10 months ago
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆851Nov 16, 2025Updated 4 months ago
- Libp2p bindings for Python☆12Jan 26, 2026Updated last month
- Gradients on demand☆33Updated this week
- Showing how to use CUDA on google colab☆13Feb 24, 2025Updated last year
- A 7B parameter model for mathematical reasoning☆42Feb 17, 2025Updated last year
- uct tree search + supervised lerning for atari games☆12Feb 14, 2017Updated 9 years ago
- fast trainer for educational purposes☆24Mar 12, 2026Updated last week
- An implementation of the Sequence to Sequence model using the Lasagne library (WIP)☆12Aug 11, 2016Updated 9 years ago
- ☆16Mar 23, 2023Updated 3 years ago
- Memory-efficient transformer. Work in progress.☆19Sep 17, 2022Updated 3 years ago
- ☆38Aug 7, 2025Updated 7 months ago
- Barebones Rust EVM Implementation☆12Feb 9, 2022Updated 4 years ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Feb 6, 2023Updated 3 years ago
- A badge for join telegram chat room or channel.☆15Jan 9, 2016Updated 10 years ago
- Main repo for Datagov Harvester 2.0. Contains the code for Flask API and Harvesting Logic☆16Updated this week
- ☆67Nov 4, 2024Updated last year
- utilities to facilitate working with codebases that don't ascribe to normal package management paradigms, e.g. ML research code that can …☆13Nov 26, 2022Updated 3 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago
- ☆18Apr 22, 2023Updated 2 years ago