JannikSt / ibtopLinks

Real-time terminal monitor for InfiniBand networks - htop for high-speed interconnects

☆48

Alternatives and similar repositories for ibtop

Users that are interested in ibtop are comparing it to the libraries listed below

Sorting:

PrimeIntellect-ai / prime-vllm
Modded vLLM to run pipeline parallelism over public networks
☆40Updated 6 months ago
PrimeIntellect-ai / pi-quant
SIMD quantization kernels
☆92Updated 2 months ago
PrimeIntellect-ai / pccl
PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP
☆138Updated 2 months ago
PrimeIntellect-ai / INTELLECT-MATH
A 7B parameter model for mathematical reasoning
☆40Updated 9 months ago
HazyResearch / cartridges
Storing long contexts in tiny caches with self-study
☆217Updated last month
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year
PrimeIntellect-ai / prime-environments
Training-Ready RL Environments + Evals
☆182Updated this week
magicproduct / hash-hop
Long context evaluation for large language models
☆224Updated 8 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆108Updated 8 months ago
nano-R1 / resources
Compiling useful links, papers, benchmarks, ideas, etc.
☆45Updated 8 months ago
google-deepmind / mishax
☆143Updated 2 months ago
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆135Updated 11 months ago
yixiaoer / tpux
A set of Python scripts that makes your experience on TPU better
☆54Updated 2 months ago
divyamakkar0 / JAXformer
A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.
☆105Updated 2 months ago
matttreed / diloco-sim
☆21Updated 10 months ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
pyember / ember
☆233Updated 5 months ago
Aleph-Alpha-Research / scaling
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…
☆66Updated last week
LeonGuertler / UnstableBaselines
☆106Updated last month
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆122Updated 10 months ago
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆174Updated 5 months ago
PrimeIntellect-ai / genesys
☆136Updated 8 months ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆168Updated 4 months ago
gautierdag / bpeasy
Fast bare-bones BPE for modern tokenizer training
☆171Updated 5 months ago
NousResearch / StripedHyenaTrainer
☆62Updated last year
samsja / pydantic_config
Manage ML configuration with pydantic
☆16Updated 6 months ago
HazyResearch / train-tk
train with kittens!
☆63Updated last year
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated last year
huggingface / kernels
Load compute kernels from the Hub
☆335Updated this week
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆240Updated 2 months ago