PrimeIntellect-ai / prime-vllmLinks
Modded vLLM to run pipeline parallelism over public networks
☆39Updated 3 months ago
Alternatives and similar repositories for prime-vllm
Users that are interested in prime-vllm are comparing it to the libraries listed below
Sorting:
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆120Updated last week
- Solidity contracts for the decentralized Prime Network protocol☆25Updated 2 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆105Updated 6 months ago
- train entropix like a champ!☆20Updated 11 months ago
- A 7B parameter model for mathematical reasoning☆40Updated 7 months ago
- ☆133Updated 5 months ago
- SIMD quantization kernels☆87Updated last week
- look how they massacred my boy☆64Updated 11 months ago
- Plotting (entropy, varentropy) for small LMs☆98Updated 3 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆84Updated this week
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configurat…☆42Updated 5 months ago
- DeMo: Decoupled Momentum Optimization☆190Updated 9 months ago
- ☆121Updated last year
- ☆21Updated 8 months ago
- NSA Triton Kernels written with GPT5 and Opus 4.1☆65Updated last month
- Train your own SOTA deductive reasoning model☆106Updated 6 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 10 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- Storing long contexts in tiny caches with self-study☆181Updated this week
- ☆223Updated 2 months ago
- Long context evaluation for large language models☆221Updated 6 months ago
- Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects☆40Updated 2 weeks ago
- Decentralized RL Training at Scale☆592Updated this week
- ☆14Updated last year
- ☆68Updated 3 months ago
- smol models are fun too☆93Updated 10 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 6 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 11 months ago
- ☆39Updated last year
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆74Updated 7 months ago