AI-Hypercomputer / pathways-utilsLinks
Package of Pathways-on-Cloud utilities
☆20Updated this week
Alternatives and similar repositories for pathways-utils
Users that are interested in pathways-utils are comparing it to the libraries listed below
Sorting:
- ☆46Updated last week
- PyTorch centric eager mode debugger☆48Updated 10 months ago
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Updated last week
- ML/DL Math and Method notes☆64Updated last year
- Simple repository for training small reasoning models☆40Updated 8 months ago
- Cray-LM unified training and inference stack.☆22Updated 8 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆13Updated 10 months ago
- A collection of reproducible inference engine benchmarks☆34Updated 5 months ago
- ☆19Updated 2 months ago
- xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…☆146Updated this week
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated 9 months ago
- ☆21Updated 7 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆59Updated this week
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated 2 months ago
- Train, tune, and infer Bamba model☆134Updated 4 months ago
- LLM training in simple, raw C/CUDA☆15Updated 10 months ago
- A place to store reusable transformer components of my own creation or found on the interwebs☆60Updated this week
- ☆77Updated last month
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆96Updated last week
- ☆15Updated last week
- Slides and recordings of talks hosted by our community☆20Updated last year
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆74Updated last month
- Benchmark suite for LLMs from Fireworks.ai☆83Updated 2 weeks ago
- ☆46Updated last year
- Google TPU optimizations for transformers models☆120Updated 8 months ago
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆50Updated last week
- 👷 Build compute kernels☆158Updated this week
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆85Updated 2 months ago
- ☆77Updated this week
- Load compute kernels from the Hub☆299Updated this week