samlobel/CFN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/samlobel/CFN)

samlobel / CFN

Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023

☆25

Alternatives and similar repositories for CFN

Users that are interested in CFN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

camall3n / onager
View on GitHub
Lightweight python library for launching experiments and tuning hyperparameters, either locally or on a cluster
☆24Sep 29, 2023Updated 2 years ago
facebookresearch / e3b
View on GitHub
Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".
☆87Mar 22, 2024Updated 2 years ago
breez3young / DIMA
View on GitHub
[NIPS'25] Official Implementation of "Revisiting Multi-Agent World Modeling from a Diffusion-Inspired Perspective" in PyTorch.
☆17Nov 11, 2025Updated 8 months ago
lilucse / SparseNetwork4DRL
View on GitHub
[ICML 2025 oral] Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
☆41Jun 5, 2025Updated last year
epignatelli / navix
View on GitHub
Accelerated minigrid environments with JAX
☆175Oct 20, 2025Updated 9 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
tianjunz / NovelD
View on GitHub
☆40Nov 23, 2021Updated 4 years ago
Baichenjia / COPO
View on GitHub
Online Preference Alignment for Language Models via Count-based Exploration
☆21Jan 14, 2025Updated last year
deep-skill-chaining / deep-skill-chaining
View on GitHub
Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"
☆30Sep 24, 2019Updated 6 years ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 7 months ago
htdt / lwm
View on GitHub
Latent World Models For Intrinsically Motivated Exploration | Official repository
☆23Apr 28, 2021Updated 5 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
ziyadsheeba / qfat
View on GitHub
[NeurIPS 2025, Spotlight] An official implementation of the paper Quantization-Free Autoregressive Action Transformer
☆11Mar 3, 2026Updated 4 months ago
danijar / ninjax
View on GitHub
General Modules for JAX
☆74Apr 7, 2026Updated 3 months ago
51616 / marl-lipo
View on GitHub
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆19May 10, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
tedmoskovitz / TOP
View on GitHub
Implementation of Tactical Optimistic and Pessimistic value estimation
☆25Jul 18, 2023Updated 3 years ago
eilab-gt / NovGrid
View on GitHub
Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …
☆34May 21, 2024Updated 2 years ago
google-deepmind / pushworld
View on GitHub
PushWorld: A benchmark for manipulation planning with tools and movable obstacles
☆95May 5, 2026Updated 2 months ago
scascin0 / alphazero
View on GitHub
A working AlphaZero implementation that's simple enough to be able to understand what's going on at a quick glance, without sacrificing t…
☆14Mar 23, 2023Updated 3 years ago
Reytuag / transformerXL_PPO_JAX
View on GitHub
☆96Feb 16, 2026Updated 5 months ago
marcharper / pyed
View on GitHub
Computes trajectories for evolutionary dynamics.
☆15Oct 6, 2020Updated 5 years ago
taodav / pobax
View on GitHub
Partially Observable Benchmarks in JAX
☆25Apr 30, 2026Updated 2 months ago
remosasso / PSDRL
View on GitHub
Code for Posterior Sampling for Deep Reinforcement Learning, ICML 2023
☆28Mar 7, 2024Updated 2 years ago
RLE-Foundation / Plasticine
View on GitHub
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
☆44Feb 9, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sail-sg / ContinualBench
View on GitHub
☆25May 20, 2025Updated last year
twni2016 / Memory-RL
View on GitHub
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆73Apr 26, 2026Updated 2 months ago
coallaoh / OpenReviewAC
View on GitHub
☆37Nov 14, 2025Updated 8 months ago
NoSavedDATA / PyTorch-BBF-Bigger-Better-Faster-Atari-100k
View on GitHub
☆17Nov 18, 2024Updated last year
luchris429 / popjaxrl
View on GitHub
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆116Dec 5, 2023Updated 2 years ago
philipjball / TD3_PyTorch
View on GitHub
♊ Minimal PyTorch Twin Delayed DDPG (TD3) implementation
☆10Jun 20, 2021Updated 5 years ago
roger-creus / Wave-Defense-Learning-Environment
View on GitHub
A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.
☆14Jan 3, 2023Updated 3 years ago
twitter-research / hyperbolic-rl
View on GitHub
☆60Sep 22, 2022Updated 3 years ago
HiddenBeginner / Deep-Reinforcement-Learnings
View on GitHub
심층강화학습 책 https://hiddenbeginner.github.io/Deep-Reinforcement-Learnings
☆11May 10, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
hafezgh / seq-jepa
View on GitHub
seq-JEPA: Autoregressive Predictive Learning of Invariant-Equivariant World Models
☆16Jan 26, 2026Updated 5 months ago
facebookresearch / impact-driven-exploration
View on GitHub
impact-driven-exploration
☆136Oct 3, 2023Updated 2 years ago
oxwhirl / opiq
View on GitHub
Code for Optimistic Exploration even with a Pessimistic Initialisation
☆14Aug 4, 2020Updated 5 years ago
yayayacc / TIDE
View on GitHub
☆18Feb 4, 2026Updated 5 months ago
AZMCode / git-credential-bw
View on GitHub
Rewrite of git-credential-bw-shell in Typescript.
☆12Apr 16, 2026Updated 3 months ago
StoneT2000 / robojax
View on GitHub
A high-performance reinforcement learning library in jax specialized for robotic learning
☆22Sep 4, 2023Updated 2 years ago