☆16Jul 8, 2024Updated last year
Alternatives and similar repositories for tpu-training-example
Users that are interested in tpu-training-example are comparing it to the libraries listed below
Sorting:
- JAX implementation of the Mistral 7b v0.2 model☆35Jul 3, 2024Updated last year
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 5 months ago
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated last year
- Einsum-like high-level array sharding API for JAX☆34Jul 16, 2024Updated last year
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- ☆15Feb 24, 2026Updated last week
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- ☆32Jul 2, 2025Updated 8 months ago
- ESM2 protein language models in JAX/Flax☆18Oct 10, 2022Updated 3 years ago
- Minimal yet performant LLM examples in pure JAX☆240Jan 14, 2026Updated last month
- ☆22May 4, 2021Updated 4 years ago
- ☆44Updated this week
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated 9 months ago
- GPU Performance Advisor☆66Jul 25, 2022Updated 3 years ago
- An experimental communicating attention kernel based on DeepEP.☆35Jul 29, 2025Updated 7 months ago
- A library to extract plaintexts from the JSON dump file of namu wiki☆26Oct 6, 2022Updated 3 years ago
- Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API,ChatGPT国内可用免费转发API,直连无需代理。☆13Aug 28, 2024Updated last year
- A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.☆158Nov 11, 2025Updated 3 months ago
- [ACL 2023] Code for ContraCLM: Contrastive Learning For Causal Language Model☆35Dec 20, 2023Updated 2 years ago
- ☆33Nov 4, 2024Updated last year
- Everything you want to know about Google Cloud TPU☆566Jul 16, 2024Updated last year
- Minimal but scalable implementation of large language models in JAX☆35Nov 28, 2025Updated 3 months ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Using spaCy and NLTK along with a Bag of Words approach, this repo automates the process of combing through FOMC minutes transcripts to d…☆12Feb 11, 2021Updated 5 years ago
- ☆28Dec 3, 2025Updated 3 months ago
- A TUI-based utility for real-time monitoring of InfiniBand traffic and performance metrics on the local node☆63Dec 19, 2025Updated 2 months ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆86Jul 28, 2024Updated last year
- ☆10Jun 8, 2024Updated last year
- Make Git monorepos like a boss.☆13Dec 18, 2025Updated 2 months ago
- Asynchronous Wiki Engine☆10Sep 4, 2023Updated 2 years ago
- 기획자와 마케터를 위한 이벤트 댓글 분석 - feat. 인프런 새해 다짐 이벤트☆11Apr 22, 2020Updated 5 years ago
- 235,886 Words for Go☆12Nov 16, 2018Updated 7 years ago
- Visualizing 230 years of US Census data☆12Feb 23, 2020Updated 6 years ago
- ☆116May 16, 2025Updated 9 months ago
- JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…☆414Jan 5, 2026Updated 2 months ago
- seqax = sequence modeling + JAX☆186Jul 23, 2025Updated 7 months ago
- Neural likelihood-free methods in PyTorch.☆39Feb 11, 2020Updated 6 years ago
- ☆52Jun 10, 2024Updated last year