See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md
☆25Dec 22, 2022Updated 3 years ago
Alternatives and similar repositories for vit_10b_fsdp_example
Users that are interested in vit_10b_fsdp_example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast and easy distributed model training examples.☆12Nov 26, 2024Updated last year
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Nov 23, 2023Updated 2 years ago
- (EasyDel Former) is a utility library designed to simplify and enhance the development in JAX☆29Mar 20, 2026Updated last week
- ☆16Apr 10, 2022Updated 3 years ago
- Google TPU optimizations for transformers models☆136Jan 23, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pytorch/XLA SPMD Test code in Google TPU☆23Apr 3, 2024Updated last year
- Document parameters using comments☆10Aug 6, 2021Updated 4 years ago
- PyTorch distributed training acceleration framework☆54Aug 13, 2025Updated 7 months ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Jun 12, 2022Updated 3 years ago
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆12Oct 17, 2023Updated 2 years ago
- Machine Learning eXperiment Utilities☆48Jul 29, 2025Updated 7 months ago
- This repository contains example code to build models on TPUs☆30Feb 17, 2023Updated 3 years ago
- JAX notebook showing how to LoRA + GPTQ arbitrary models☆10Aug 8, 2023Updated 2 years ago
- A compiler written in Java to compile a subset of instructions called MiniJava.☆10Apr 20, 2015Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Simple Adaptive Unfolding Network for Hyperspectral Image Reconstruction☆32Feb 1, 2023Updated 3 years ago
- This repository contains scripts for conversion of data required for most commonly found Machine Learning tasks to TFRecords☆13Mar 6, 2021Updated 5 years ago
- VIT inference in triton because, why not?☆36May 31, 2024Updated last year
- hllama is a library which aims to provide a set of utility tools for large language models.☆10Apr 16, 2024Updated last year
- Unofficial entropix impl for Gemma2 and Llama and Qwen2 and Mistral☆17Jan 12, 2025Updated last year
- Code of "NeuSample: Neural Sample Field for Efficient View Synthesis"☆37Oct 10, 2022Updated 3 years ago
- A simple library for scaling up JAX programs☆146Nov 4, 2025Updated 4 months ago
- ViT trained on COYO-Labeled-300M dataset☆33Nov 24, 2022Updated 3 years ago
- Creates a Docker image with all the prerequisites needed to run the projects of the Udacity Robotics Nanodegree.☆13Feb 13, 2018Updated 8 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Solution of Kaggle competition: MAP - Charting Student Math Misunderstandings☆26Oct 25, 2025Updated 5 months ago
- Demo project for Cordova Host Card Emulation (HCE) plugin☆11Dec 7, 2015Updated 10 years ago
- The official github repo for MixEval-X, the first any-to-any, real-world benchmark.☆17Feb 15, 2025Updated last year
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- Implementation of numerous Vision Transformers in Google's JAX and Flax.☆22Aug 30, 2022Updated 3 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Jun 16, 2024Updated last year
- TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구현체입니다.☆12Jun 12, 2023Updated 2 years ago
- A set of Python scripts that makes your experience on TPU better☆56Sep 18, 2025Updated 6 months ago
- Pile Deduplication Code☆18May 15, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆13May 7, 2023Updated 2 years ago
- Featurized Query R-CNN☆45Jun 17, 2022Updated 3 years ago
- Python compiler that utilizes PLY and llvmlite☆12Apr 5, 2018Updated 7 years ago
- Instruction Following Eval☆16Jan 16, 2025Updated last year
- Please visit https://github.com/HKUSTDial/NL2SQL360 to get the official code!☆10Sep 1, 2024Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 11 months ago
- ☆57Apr 23, 2024Updated last year