Pytorch/XLA SPMD Test code in Google TPU
☆23Apr 3, 2024Updated 2 years ago
Alternatives and similar repositories for torch-xla-SPMD
Users that are interested in torch-xla-SPMD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md☆25Dec 22, 2022Updated 3 years ago
- A simple cycle-accurate DaDianNao simulator☆13Mar 27, 2019Updated 7 years ago
- A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…☆34Mar 4, 2025Updated last year
- 0-Shot Tokenizer Transplant☆14May 16, 2025Updated 11 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- (EasyDel Former) is a utility library designed to simplify and enhance the development in JAX☆30Updated this week
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆355Updated this week
- [ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model☆17Apr 28, 2025Updated 11 months ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- ☆13Apr 25, 2024Updated last year
- KANs and MLPs☆12Jun 7, 2024Updated last year
- Deep Learning Gravity Optimizer Source Code Repository☆13Jul 26, 2021Updated 4 years ago
- Train to 94% on CIFAR-10 in 4.4 seconds on a single A100☆12Dec 30, 2023Updated 2 years ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆19Jul 24, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13May 21, 2024Updated last year
- 🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.☆17Jun 5, 2025Updated 10 months ago
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- An implementation of DecorrelatedBN by tensorflow☆13Jun 30, 2022Updated 3 years ago
- ☆17Apr 3, 2026Updated 2 weeks ago
- Bullseye Polytope Clean-Label Poisoning Attack☆15Nov 5, 2020Updated 5 years ago
- ☆11Jul 11, 2023Updated 2 years ago
- 트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델☆10Dec 5, 2022Updated 3 years ago
- ☆21Sep 6, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Jan 23, 2024Updated 2 years ago
- Neural network density models for speech separation.☆20Nov 26, 2020Updated 5 years ago
- PhoneGap NFC peer to peer demo☆22Jan 6, 2017Updated 9 years ago
- [NeurIPS 2025] MergeBench: A Benchmark for Merging Domain-Specialized LLMs☆46Feb 11, 2026Updated 2 months ago
- ☆14Dec 9, 2021Updated 4 years ago
- Orthogonal Matching Pursuit, parallelized on both CPU and GPU. 100x+ Speedup☆16Mar 30, 2026Updated 2 weeks ago
- Implementation of Bitune: Bidirectional Instruction-Tuning☆27Jun 19, 2025Updated 10 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …☆60Oct 11, 2024Updated last year
- Demo project for Cordova Host Card Emulation (HCE) plugin☆11Dec 7, 2015Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- JAX Implementations of Descript Audio Codec and EnCodec☆35Mar 30, 2025Updated last year
- This project is the official implementation of our ACM MM 2024 paper, OmniStitch: Depth-aware Stitching Framework for Omnidirectional Vis…☆18Aug 5, 2024Updated last year
- [ICLR 2025] Official code repository for "TULIP: Token-length Upgraded CLIP"☆32Jan 26, 2026Updated 2 months ago
- [Findings of NAACL2022] A Dog Is Passing Over The Jet? A Text-Generation Dataset for Korean Commonsense Reasoning and Evaluation☆11May 27, 2022Updated 3 years ago
- ☆19Sep 19, 2022Updated 3 years ago
- A simple React component that handles file drag and drop.☆12Sep 3, 2017Updated 8 years ago