ztjhz / t5-jax
JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
☆20Updated last year
Alternatives and similar repositories for t5-jax:
Users that are interested in t5-jax are comparing it to the libraries listed below
- Machine Learning eXperiment Utilities☆45Updated 7 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆14Updated last year
- Jax like function transformation engine but micro, microjax☆30Updated 3 months ago
- Repo to reproduce the First-Explore paper results☆37Updated last month
- ☆29Updated last month
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Implementation of Direct Preference Optimization☆15Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated last month
- ☆31Updated last year
- TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.☆18Updated 7 months ago
- ☆22Updated last year
- JAX implementation of the Mistral 7b v0.2 model☆35Updated 6 months ago
- ☆58Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 8 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Updated last week
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Updated 5 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- Clean RL implementation using MLX☆28Updated 10 months ago
- ☆30Updated 2 months ago
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- ☆25Updated 4 months ago
- ☆45Updated 10 months ago
- Minimal but scalable implementation of large language models in JAX☆28Updated 2 months ago
- ☆19Updated 7 months ago
- Code for minimum-entropy coupling.☆31Updated 7 months ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 3 weeks ago
- Public Inflection Benchmarks☆69Updated 10 months ago