brentyi/transformer-exercises-jax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/brentyi/transformer-exercises-jax)

brentyi / transformer-exercises-jax

☆18

Alternatives and similar repositories for transformer-exercises-jax

Users that are interested in transformer-exercises-jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuqingd / cusp
View on GitHub
☆15Sep 7, 2022Updated 3 years ago
vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
keraJLi / synthetic-gymnax
View on GitHub
Drop-in environment replacements that make your RL algorithm train faster.
☆22Jun 19, 2024Updated 2 years ago
bmazoure / ppo_jax
View on GitHub
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆62Aug 4, 2022Updated 3 years ago
henry-prior / multimodal-rl
View on GitHub
Solving reinforcement learning tasks which require language and vision
☆33Apr 4, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Boyiliee / ITP-BobaRobot
View on GitHub
Code for "Interactive Task Planning with Language Models"
☆34Jan 12, 2026Updated 6 months ago
maxreciprocate / offline
View on GitHub
Offline RL experiments
☆15Oct 1, 2022Updated 3 years ago
ethanluoyc / corax
View on GitHub
Corax: Core RL in JAX
☆41Feb 22, 2024Updated 2 years ago
kevinzakka / nanorl
View on GitHub
A tiny reinforcement learning codebase for continuous control, built on top of JAX.
☆15Mar 28, 2023Updated 3 years ago
ToruOwO / mimex
View on GitHub
MIMEx: Intrinsic Rewards from Masked Input Modeling [NeurIPS 2023]
☆16May 17, 2023Updated 3 years ago
young-geng / JaxCQL
View on GitHub
Conservative Q learning in Jax
☆58Feb 7, 2023Updated 3 years ago
ikostrikov / jaxrl2
View on GitHub
☆58Jan 20, 2023Updated 3 years ago
young-geng / SimpleSAC
View on GitHub
A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Sep 2, 2022Updated 3 years ago
brentyi / jax-ekf
View on GitHub
Generic EKF, with support for non-Euclidean manifolds
☆25Apr 6, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Ending2015a / unstable_baselines
View on GitHub
A TF2.0 implementation of RL baselines.
☆10Sep 24, 2021Updated 4 years ago
TrentBrick / RewardConditionedUDRL
View on GitHub
Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies
☆19Mar 10, 2021Updated 5 years ago
henry-prior / jax-rl
View on GitHub
JAX implementations of core Deep RL algorithms
☆84May 2, 2022Updated 4 years ago
rraileanu / idaac
View on GitHub
☆55Feb 28, 2024Updated 2 years ago
jeffdonahue / CS280MiniPlaces
View on GitHub
Homework 3 for Berkeley CS 280: our version of the MIT Mini Places challenge
☆12Mar 5, 2016Updated 10 years ago
pickxiguapi / Clean-Offline-RLHF
View on GitHub
Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …
☆42Mar 26, 2024Updated 2 years ago
davidbrandfonbrener / imitation_pretraining
View on GitHub
☆20May 30, 2023Updated 3 years ago
facebookresearch / mtm
View on GitHub
MTM Masked Trajectory Models for Prediction, Representation, and Control.
☆166Dec 16, 2025Updated 7 months ago
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Lifelong-ML / offline-compositional-rl-datasets
View on GitHub
☆21Mar 19, 2024Updated 2 years ago
clvrai / create
View on GitHub
CREATE Environment for long-horizon physics-puzzle tasks with diverse tools
☆18Nov 22, 2022Updated 3 years ago
ethanluoyc / magi
View on GitHub
Reinforcement learning library in JAX.
☆102Oct 22, 2023Updated 2 years ago
chungmin99 / jaxmp
View on GitHub
New version: https://github.com/chungmin99/pyroki
☆20Apr 13, 2025Updated last year
kevinzakka / dm_env_wrappers
View on GitHub
Standalone library of frequently-used wrappers for dm_env environments.
☆19Jul 9, 2024Updated 2 years ago
jkulhanek / gym-deepmindlab-env
View on GitHub
Gym implementation of connector to Deepmind lab
☆12Mar 26, 2019Updated 7 years ago
brentyi / tilted
View on GitHub
Canonical Factors for Hybrid Neural Fields @ ICCV 2023
☆107Jan 21, 2025Updated last year
Egiob / DiversityIsAllYouNeed-SB3
View on GitHub
Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.
☆13Jul 11, 2022Updated 4 years ago
danijar / elements
View on GitHub
Building blocks for productive research
☆72Mar 26, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
frost-beta / train-model-with-js
View on GitHub
Train text generation model with JavaScript.
☆15Jul 14, 2024Updated 2 years ago
yiyixuxu / denoising-diffusion-flax
View on GitHub
Implementing the Denoising Diffusion Probabilistic Model in Flax
☆161Nov 1, 2022Updated 3 years ago
nissymori / remax-rl
View on GitHub
[ICML2026] Official JAX code for Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
☆15Jul 3, 2026Updated 2 weeks ago
wangshusen / PyRLA
View on GitHub
Randomized Linear Algebra in Python
☆13Mar 21, 2017Updated 9 years ago
smonsays / metax
View on GitHub
flexible meta-learning in jax
☆16Oct 19, 2023Updated 2 years ago
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago