ztjhz / t5-jaxLinks

JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

☆23

Alternatives and similar repositories for t5-jax

Users that are interested in t5-jax are comparing it to the libraries listed below

Sorting:

Sea-Snell / CALM-Dialogue
Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"
☆34Updated 2 years ago
vvvm23 / mamba-jax
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆84Updated last year
andylolu2 / jax-vqvae-gpt
Implementation of VQ-VAE with a GPT-style sampler in the JAX and Haiku ecosystem.
☆12Updated last year
btnorman / First-Explore
Repo to reproduce the First-Explore paper results
☆37Updated 6 months ago
young-geng / mlxu
Machine Learning eXperiment Utilities
☆46Updated 3 weeks ago
CEC-Agent / CEC
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆31Updated last year
ml-jku / LRAM
A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks
☆33Updated 8 months ago
microsoft / RLHF-APA
RL algorithm: Advantage induced policy alignment
☆65Updated last year
mansimov / chatgpt_cli
Lightweight wrapper of the official ChatGPT API in your terminal
☆43Updated 2 years ago
abaheti95 / LoL-RL
Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients
☆26Updated 10 months ago
young-geng / tpu_pod_commander
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
☆20Updated last year
sholtodouglas / scalingExperiments
☆61Updated 3 years ago
CarperAI / treasure_trove
☆22Updated last year
scottlogic-alex / prm800k-denorm
Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Updated 2 years ago
radarFudan / mamba-minimal-jax
☆31Updated 7 months ago
andrew-silva / clean-rl-mlx
Clean RL implementation using MLX
☆32Updated last year
andyljones / boardlaw
Scaling scaling laws with board games.
☆49Updated 2 years ago
microsoft / Intrepid
INTeractive learning via REPresentatIon Discovery
☆34Updated last year
FLAIROx / cultural-accumulation
☆13Updated last year
prajjwal1 / rl_paradigm
☆17Updated last year
google-deepmind / dks
Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…
☆71Updated 2 weeks ago
smearle / autoverse
Generative cellular automaton-like learning environments for RL.
☆19Updated 5 months ago
google-deepmind / agent_debugger
Causal Analysis of Agent Behavior for AI Safety
☆18Updated 2 years ago
okarthikb / DPO
Implementation of Direct Preference Optimization
☆16Updated 2 years ago
CarperAI / squeakily
A library for squeakily cleaning and filtering language datasets.
☆47Updated 2 years ago
upiterbarg / diff_history
[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
☆20Updated 11 months ago
carlini / chess-llm
Play chess against large language models.
☆47Updated last year
google-research / precondition
☆31Updated last month
NousResearch / StripedHyenaTrainer
☆61Updated last year
google-deepmind / enn_acme
☆31Updated 2 years ago