AI-Hypercomputer / pathways-utilsLinks
Package of Pathways-on-Cloud utilities
☆20Updated 3 weeks ago
Alternatives and similar repositories for pathways-utils
Users that are interested in pathways-utils are comparing it to the libraries listed below
Sorting:
- ☆51Updated last week
- Simple repository for training small reasoning models☆44Updated 9 months ago
- ☆21Updated 8 months ago
- ☆46Updated last year
- ML/DL Math and Method notes☆64Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆53Updated last month
- PyTorch centric eager mode debugger☆48Updated 10 months ago
- A sample pattern for running CI tests on Modal☆18Updated 6 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated 3 months ago
- Intel Gaudi's Megatron DeepSpeed Large Language Models for training☆14Updated 10 months ago
- ☆19Updated 2 weeks ago
- ☆15Updated 11 months ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated 10 months ago
- The official evaluation suite and dynamic data release for MixEval.☆11Updated last year
- A tool for an analysis of LLM generations.☆40Updated 3 weeks ago
- torchax is a PyTorch frontend for JAX. It gives JAX the ability to author JAX programs using familiar PyTorch syntax. It also provides JA…☆117Updated this week
- ☆15Updated 2 weeks ago
- train with kittens!☆63Updated last year
- Train, tune, and infer Bamba model☆135Updated 5 months ago
- ☆15Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated 3 weeks ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last month
- ☆55Updated last year
- Slide decks, coding exercises, and quick references for learning the JAX AI Stack☆65Updated this week
- Tensor-Slayer : Manipulate weights and tensors of LLMs to achieve performance upgrades and introduce a novel inferenceless mechanistic in…☆25Updated 5 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆61Updated last week
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆31Updated last year
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆20Updated 9 months ago
- All information and news with respect to Falcon-H1 series☆93Updated 3 weeks ago
- ☆11Updated 7 months ago