ztjhz / t5-jax
JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
☆22Updated last year
Alternatives and similar repositories for t5-jax
Users that are interested in t5-jax are comparing it to the libraries listed below
Sorting:
- Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.☆12Updated last year
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).☆29Updated 2 years ago
- GPT implementation in Flax☆18Updated 3 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- Machine Learning eXperiment Utilities☆46Updated 11 months ago
- Repo to reproduce the First-Explore paper results☆37Updated 4 months ago
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆20Updated 10 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- ☆18Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆10Updated last year
- Generative cellular automaton-like learning environments for RL.☆19Updated 3 months ago
- Causal Analysis of Agent Behavior for AI Safety☆18Updated last year
- ☆22Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Lightweight wrapper of the official ChatGPT API in your terminal☆43Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆18Updated 7 months ago
- ☆28Updated 2 years ago
- General Modules for JAX☆66Updated last month
- Implementation of Diffusion Transformers and Rectified Flow in Jax☆23Updated 10 months ago
- Jax like function transformation engine but micro, microjax☆32Updated 6 months ago
- ☆17Updated last year
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆28Updated 4 years ago
- ☆13Updated 10 months ago
- ☆31Updated last month
- ☆31Updated 2 years ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆33Updated 6 months ago