tianjunz/NovelD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tianjunz/NovelD)

tianjunz / NovelD

☆40

Alternatives and similar repositories for NovelD

Users that are interested in NovelD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 3 years ago
swan-utokyo / deir
View on GitHub
DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards
☆26May 6, 2024Updated 2 years ago
sparisi / cbet
View on GitHub
Change-Based Exploration Transfer
☆35Apr 24, 2022Updated 4 years ago
facebookresearch / impact-driven-exploration
View on GitHub
impact-driven-exploration
☆136Oct 3, 2023Updated 2 years ago
tianjunz / MADE
View on GitHub
☆19Jul 18, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
facebookresearch / e3b
View on GitHub
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
☆87Mar 22, 2024Updated 2 years ago
kingdy2002 / VCSE
View on GitHub
☆18Jun 8, 2023Updated 3 years ago
yfletberliac / adversarially-guided-actor-critic
View on GitHub
AGAC: Adversarially Guided Actor-Critic
☆47Sep 16, 2021Updated 4 years ago
snu-mllab / EMI
View on GitHub
Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.
☆37Dec 7, 2020Updated 5 years ago
Stanford-ILIAD / ELLA
View on GitHub
Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.
☆21Mar 9, 2021Updated 5 years ago
rraileanu / auto-drac
View on GitHub
Automatic Data-Regularized Actor-Critic (Auto-DrAC)
☆104Mar 24, 2023Updated 3 years ago
mcmachado / count_based_exploration_sr
View on GitHub
☆31Jul 1, 2019Updated 7 years ago
sfujim / SR-DICE
View on GitHub
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆28Dec 7, 2021Updated 4 years ago
YuhangSong / DEHRL
View on GitHub
Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.
☆49Feb 23, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
microsoft / segar
View on GitHub
Sandbox environment for generalizable agent research
☆27Aug 19, 2022Updated 3 years ago
htdt / lwm
View on GitHub
Latent World Models For Intrinsically Motivated Exploration | Official repository
☆23Apr 28, 2021Updated 5 years ago
ml-jku / helm
View on GitHub
☆57Nov 5, 2024Updated last year
princeton-nlp / XTX
View on GitHub
[ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games
☆15Feb 8, 2026Updated 5 months ago
facebookresearch / adversarially-motivated-intrinsic-goals
View on GitHub
This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".
☆65Sep 6, 2023Updated 2 years ago
facebookresearch / CollaQ
View on GitHub
A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"
☆132Aug 14, 2023Updated 2 years ago
holarissun / PCHID_code
View on GitHub
Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics
☆15Jan 7, 2020Updated 6 years ago
lektor-lol / lektor-lol.github.io
View on GitHub
a minimal website to get the diff of llm rewrites
☆11Dec 11, 2024Updated last year
sahandrez / homomorphic_policy_gradient
View on GitHub
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆24Apr 8, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tohid-yousefi / Meta-Heuristics
View on GitHub
In this section, I share the Meta-Heuristic algorithm codes that I wrote myself
☆13Apr 6, 2023Updated 3 years ago
denisyarats / proto
View on GitHub
Proto-RL: Reinforcement Learning with Prototypical Representations
☆87Jun 12, 2022Updated 4 years ago
allenai / robustnav
View on GitHub
Evaluating pre-trained navigation agents under corruptions
☆31Sep 7, 2021Updated 4 years ago
yifan12wu / rl-laplacian
View on GitHub
Learning Laplacian Representations in Reinforcement Learning
☆18Jan 2, 2021Updated 5 years ago
younggyoseo / RE3
View on GitHub
RE3: State Entropy Maximization with Random Encoders for Efficient Exploration
☆69Jul 29, 2021Updated 4 years ago
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
paulorauber / hpg
View on GitHub
Hindsight policy gradients
☆46Jan 31, 2020Updated 6 years ago
tigerneil / reinforcementlearning.today
View on GitHub
Made for a reading group at the Center for Safe AGI.
☆12Feb 23, 2026Updated 5 months ago
automl / hypersweeper
View on GitHub
Hydra sweeper integration of our favorite optimization packages, utilizing ask-and-tell interfaces.
☆16Nov 14, 2025Updated 8 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
luwei-fu / Nomad-Algorithm
View on GitHub
A testing platform for intelligent optimization algorithm based on Matlab with CEC2013 benchmark
☆12Jan 26, 2021Updated 5 years ago
zhaoyi11 / tcrl
View on GitHub
☆26Jan 26, 2024Updated 2 years ago
MZhouke / RL-Scheduling
View on GitHub
Code base for publication: Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems
☆10Feb 1, 2023Updated 3 years ago
4rChon / NL-FuN
View on GitHub
N-Layered FeUdal Networks based on FeUdal Networks adapted to suit PySC2 observations
☆19Sep 17, 2019Updated 6 years ago
shihui2010 / symbolic_simplifier
View on GitHub
PyTorch implementation for the Deep Symbolic Simplification Without Human Knowledge
☆14Feb 25, 2021Updated 5 years ago
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
ShibiHe / Model-Free-Episodic-Control
View on GitHub
This is the implementation of paper Model Free Episodic Control
☆34Sep 30, 2019Updated 6 years ago