JannikSt / ibtopLinks
Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects
☆44Updated last month
Alternatives and similar repositories for ibtop
Users that are interested in ibtop are comparing it to the libraries listed below
Sorting:
- Modded vLLM to run pipeline parallelism over public networks☆39Updated 5 months ago
- SIMD quantization kernels☆89Updated last month
- Training-Ready RL Environments + Evals☆132Updated last week
- A 7B parameter model for mathematical reasoning☆40Updated 8 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆133Updated last month
- Storing long contexts in tiny caches with self-study☆205Updated last week
- Long context evaluation for large language models☆224Updated 7 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆107Updated 7 months ago
- ☆135Updated 7 months ago
- ☆21Updated 9 months ago
- Simple Transformer in Jax☆139Updated last year
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆64Updated 3 weeks ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆87Updated last year
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Updated 7 months ago
- ☆105Updated last week
- ☆231Updated 4 months ago
- Google TPU optimizations for transformers models☆121Updated 9 months ago
- ☆142Updated last month
- NanoGPT-speedrunning for the poor T4 enjoyers☆72Updated 6 months ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆300Updated 2 months ago
- ☆46Updated last year
- Simple & Scalable Pretraining for Neural Architecture Research☆297Updated 2 months ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆104Updated last month
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆100Updated this week
- peer-to-peer compute and intelligence network that enables decentralized AI development at scale☆127Updated 3 months ago
- Load compute kernels from the Hub☆308Updated this week
- supporting pytorch FSDP for optimizers☆83Updated 10 months ago
- 👷 Build compute kernels☆163Updated this week
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆98Updated 3 months ago
- code for training & evaluating Contextual Document Embedding models☆199Updated 5 months ago