machelreid/can-wikipedia-help-offline-rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/machelreid/can-wikipedia-help-offline-rl)

machelreid / can-wikipedia-help-offline-rl

Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu

☆105

Alternatives and similar repositories for can-wikipedia-help-offline-rl

Users that are interested in can-wikipedia-help-offline-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mila-iqia / SGI
View on GitHub
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
☆56Jul 27, 2021Updated 5 years ago
htdt / lwm
View on GitHub
Latent World Models For Intrinsically Motivated Exploration | Official repository
☆23Apr 28, 2021Updated 5 years ago
daniellawson9999 / online-decision-transformer
View on GitHub
An unofficial implementation for online decision transformer
☆41Sep 20, 2022Updated 3 years ago
jon--lee / decision-pretrained-transformer
View on GitHub
Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…
☆79May 28, 2024Updated 2 years ago
google-deepmind / dm_fast_mapping
View on GitHub
☆55Oct 28, 2021Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
mxu34 / prompt-dt
View on GitHub
Official code repository for Prompt-DT.
☆123Aug 3, 2022Updated 3 years ago
dunnolab / laom
View on GitHub
Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025
☆39Jul 8, 2025Updated last year
suraj-nair-1 / lorel
View on GitHub
☆38Mar 10, 2022Updated 4 years ago
ShuangLI59 / Pre-Trained-Language-Models-for-Interactive-Decision-Making
View on GitHub
Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]
☆131Jun 8, 2022Updated 4 years ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
ahjwang / messenger-emma
View on GitHub
Implements the Messenger environment and EMMA model.
☆25Jun 14, 2023Updated 3 years ago
ml-jku / helm
View on GitHub
☆57Nov 5, 2024Updated last year
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
acmi-lab / pretraining-with-nonsense
View on GitHub
Pretraining summarization models using a corpus of nonsense
☆13Sep 28, 2021Updated 4 years ago
tinkoff-ai / lb-sac
View on GitHub
Official implementation for "Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size", NeurIPS 2022, Offline RL Worksho…
☆21Feb 27, 2023Updated 3 years ago
RajGhugare19 / stitching-is-combinatorial-generalisation
View on GitHub
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆25Apr 19, 2024Updated 2 years ago
amazon-science / embert
View on GitHub
Code for EmBERT, a transformer model for embodied, language-guided visual task completion.
☆60Apr 10, 2024Updated 2 years ago
zhxieml / PDT
View on GitHub
Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer
☆29Jul 25, 2023Updated 3 years ago
srzer / LaMo-2023
View on GitHub
Official code for "Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning".
☆53Apr 11, 2024Updated 2 years ago
dunnolab / harmony
View on GitHub
[ICML 2026 GenBio Workshop] Official Implementation for "Harmonic Torsional Diffusion for Protein-Ligand Flexible Docking"
☆15Jun 30, 2026Updated 3 weeks ago
microsoft / smart
View on GitHub
Codebase for ICLR 2023 paper, "SMART: Self-supervised Multi-task pretrAining with contRol Transformers"
☆54Jan 26, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
keynans / HypeRL
View on GitHub
Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)
☆26Jun 9, 2021Updated 5 years ago
mserranunes / action-inference-for-video-prediction-benchmarking
View on GitHub
Evaluating video predictions from the standpoint of a robot making action decisions
☆13May 28, 2020Updated 6 years ago
hychen-naza / LEAP
View on GitHub
☆17Sep 28, 2023Updated 2 years ago
google-research / deep_ope
View on GitHub
☆88Jul 30, 2024Updated last year
flowersteam / playground_env
View on GitHub
Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.
☆11Mar 5, 2021Updated 5 years ago
corl-team / ad-eps
View on GitHub
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
☆35Sep 18, 2024Updated last year
dunnolab / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆51May 23, 2025Updated last year
frt03 / generalized_dt
View on GitHub
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆70Aug 8, 2022Updated 3 years ago
Howuhh / faster-trajectory-transformer
View on GitHub
Implementation of Trajectory Transformer with attention caching and batched beam search
☆118Apr 27, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
andravin / algebranets
View on GitHub
Unofficial Experiments with AlgebraNets
☆17Jun 17, 2020Updated 6 years ago
denisyarats / proto
View on GitHub
Proto-RL: Reinforcement Learning with Prototypical Representations
☆88Jun 12, 2022Updated 4 years ago
ademiadeniji / lamp
View on GitHub
☆47Jan 29, 2024Updated 2 years ago
rll-research / teachable
View on GitHub
☆17Oct 12, 2023Updated 2 years ago
kzl / decision-transformer
View on GitHub
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
☆2,823Apr 29, 2024Updated 2 years ago
webstorms / Blocks
View on GitHub
A new model for quickly training and simulating adaptive leaky integrate-and-fire spiking neural networks.
☆14Apr 9, 2024Updated 2 years ago
paulorauber / hpg
View on GitHub
Hindsight policy gradients
☆46Jan 31, 2020Updated 6 years ago