chongyi-zheng / td_infonceLinks

Implementations of Temporal Difference InfoNCE (TD InfoNCE)

☆30

Alternatives and similar repositories for td_infonce

Users that are interested in td_infonce are comparing it to the libraries listed below

Sorting:

kvfrans / fre
Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"
☆57Updated last year
ahmed-touati / controllable_agent
☆47Updated 2 years ago
facebookresearch / controllable_agent
The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…
☆66Updated 2 years ago
seohongpark / METRA
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆71Updated last year
micahcarroll / uniMASK
Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"
☆56Updated last year
ml-jku / L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
☆59Updated 10 months ago
facebookresearch / gen_dgrl
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆28Updated 11 months ago
facebookresearch / mtm
MTM Masked Trajectory Models for Prediction, Representation, and Control.
☆157Updated 2 years ago
seohongpark / PMA
Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)
☆32Updated 2 years ago
chandar-lab / Recall2Imagine
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
☆70Updated last year
rll-research / cic
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
☆81Updated 3 years ago
denisyarats / proto
Proto-RL: Reinforcement Learning with Prototypical Representations
☆82Updated 3 years ago
danijar / elements
Building blocks for productive research
☆59Updated 6 months ago
vivekmyers / contrastive_metrics
Code for the paper "Learning Temporal Distances: Contrastive Successor Features Can Provide a Metric Structure for Decision-Making"
☆27Updated last year
scottemmons / rvs
Reinforcement Learning via Supervised Learning
☆71Updated 3 years ago
denisyarats / exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
☆115Updated 3 years ago
danijar / ninjax
General Modules for JAX
☆66Updated 4 months ago
ml-jku / helm
☆54Updated 9 months ago
microsoft / segar
Sandbox environment for generalizable agent research
☆26Updated 2 years ago
mila-iqia / SGI
Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)
☆54Updated 4 years ago
quasimetric-learning / torch-quasimetric
PyTorch Package For Quasimetric Learning
☆42Updated 9 months ago
danijar / director
Deep Hierarchical Planning from Pixels
☆107Updated 2 years ago
young-geng / JaxCQL
Conservative Q learning in Jax
☆54Updated 2 years ago
orybkin / lexa
Discovering and Achieving Goals via World Models, NeurIPS 2021
☆85Updated last year
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆114Updated 11 months ago
machelreid / can-wikipedia-help-offline-rl
Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu
☆105Updated 3 years ago
seohongpark / HILP
Foundation Policies with Hilbert Representations (ICML 2024)
☆90Updated last year
dojeon-ai / SimbaV2
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆58Updated 2 months ago
ademiadeniji / irm
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
☆43Updated last year
Div99 / XQL
Extreme Q-Learning: Max Entropy RL without Entropy
☆87Updated 2 years ago