AI-Hypercomputer / pathways-utilsLinks
Package of Pathways-on-Cloud utilities
☆22Updated this week
Alternatives and similar repositories for pathways-utils
Users that are interested in pathways-utils are comparing it to the libraries listed below
Sorting:
- ☆69Updated last week
- ☆21Updated 10 months ago
- AI-Driven Research Systems (ADRS)☆113Updated 3 weeks ago
- ☆16Updated 2 months ago
- A lightweight, user-friendly data-plane for LLM training.☆38Updated 4 months ago
- PCCL (Prime Collective Communications Library) implements fault tolerant collective communications over IP☆141Updated 4 months ago
- Memory optimized Mixture of Experts☆72Updated 5 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated last year
- Write a fast kernel and run it on Discord. See how you compare against the best!☆66Updated last week
- vLLM adapter for a TGIS-compatible gRPC server.☆47Updated this week
- PyTorch centric eager mode debugger☆48Updated last year
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆159Updated this week
- A collection of lightweight interpretability scripts to understand how LLMs think☆88Updated 2 weeks ago
- Simple repository for training small reasoning models☆47Updated 11 months ago
- LM engine is a library for pretraining/finetuning LLMs☆108Updated this week
- ☆15Updated last year
- ML/DL Math and Method notes☆66Updated 2 years ago
- Parallel framework for training and fine-tuning deep neural networks☆69Updated 2 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Updated 3 weeks ago
- ☆16Updated last year
- A collection of reproducible inference engine benchmarks☆38Updated 8 months ago
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆112Updated 5 months ago
- ☆47Updated last year
- Make triton easier☆50Updated last year
- Train, tune, and infer Bamba model☆137Updated 7 months ago
- Some microbenchmarks and design docs before commencement☆12Updated 4 years ago
- ☆27Updated this week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆153Updated last year
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆79Updated 3 weeks ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆71Updated this week