Automatically take good care of your preemptible TPUs
☆37May 15, 2023Updated 2 years ago
Alternatives and similar repositories for tpucare
Users that are interested in tpucare are comparing it to the libraries listed below
Sorting:
- Implementation of PSGD optimizer in JAX☆35Dec 31, 2024Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- ☆18Aug 24, 2024Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Jan 20, 2024Updated 2 years ago
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- An LLM inference engine, written in C++☆18Feb 5, 2026Updated 3 weeks ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- Maximal Update Parametrization (μP) with Flax & Optax.☆16Dec 27, 2023Updated 2 years ago
- Masking Strategies for Background Bias Removal in Computer Vision Models (ICCVW OODCV 2023 paper)☆16Jul 3, 2025Updated 7 months ago
- Synthetic Data Generation for Evaluation☆13Feb 21, 2025Updated last year
- ☆16Dec 30, 2024Updated last year
- ☆21Jan 23, 2024Updated 2 years ago
- Virtual Adversarial Training (VAT) techniques in PyTorch☆17Jul 19, 2022Updated 3 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Mar 22, 2023Updated 2 years ago
- Easily turn large sets of audio urls to an audio dataset.☆21Dec 27, 2022Updated 3 years ago
- Reimplementation of `Improving language models by retrieving from trillions of tokens`☆19Nov 16, 2022Updated 3 years ago
- Machine Learning eXperiment Utilities☆48Jul 29, 2025Updated 7 months ago
- ☆25Dec 12, 2025Updated 2 months ago
- This is the official repo for Gradient Agreement Filtering (GAF).☆24Jan 27, 2025Updated last year
- Train vision models using JAX and 🤗 transformers☆100Dec 14, 2025Updated 2 months ago
- Optimized library for large-scale extraction of frames and audio from video.☆201Sep 11, 2023Updated 2 years ago
- Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch☆88Dec 3, 2021Updated 4 years ago
- A six-dimensional evaluation framework for drama script continuation with interactive leaderboard and case studies☆82Jan 1, 2026Updated 2 months ago
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆27Jul 9, 2024Updated last year
- Generate visual podcasts about novels using open source models☆26Feb 15, 2023Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Jan 10, 2022Updated 4 years ago
- This repository contains the data and code for the paper "Diverse Text Generation via Variational Encoder-Decoder Models with Gaussian Pr…☆26Jun 27, 2022Updated 3 years ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- Efficient optimizers☆285Dec 20, 2025Updated 2 months ago
- ☆29Sep 30, 2025Updated 5 months ago
- CLOOB Conditioned Latent Diffusion training and inference code☆111Apr 15, 2022Updated 3 years ago
- ☆32Jul 24, 2023Updated 2 years ago
- No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)☆29Feb 9, 2022Updated 4 years ago
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 11 months ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Focused on fast experimentation and simplicity☆80Dec 24, 2024Updated last year
- Deep Networks Grok All the Time and Here is Why☆38May 18, 2024Updated last year
- XFT: Unlocking the Power of Code Instruction Tuning by Simply Merging Upcycled Mixture-of-Experts☆35Jul 2, 2024Updated last year
- ☆33Nov 4, 2024Updated last year