Baichenjia/Contrastive-UCB

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Baichenjia/Contrastive-UCB)

Baichenjia / Contrastive-UCB

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

☆12

Alternatives and similar repositories for Contrastive-UCB

Users that are interested in Contrastive-UCB are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ec2604 / ContraBAR
View on GitHub
☆13May 21, 2023Updated 3 years ago
srnbckr / ebpf-network-emulation
View on GitHub
☆12Aug 12, 2022Updated 3 years ago
Kaixhin / GUDRL
View on GitHub
Generalised UDRL
☆37May 12, 2022Updated 4 years ago
ademiadeniji / irm
View on GitHub
Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)
☆42Jan 13, 2024Updated 2 years ago
Ethos-lab / ares
View on GitHub
A System-Oriented Wargame Framework for Adversarial ML
☆10Apr 24, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ramp-kits / rl_simulator
View on GitHub
Model-based reinforcement learning (generative simulator models and planning agents)
☆16Mar 13, 2026Updated 4 months ago
taodav / nsrs
View on GitHub
Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.
☆14Jul 16, 2024Updated 2 years ago
Sandholm-Lab / ESCHER
View on GitHub
☆16Jul 13, 2022Updated 4 years ago
liuruoze / HierNet-SC2
View on GitHub
(AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…
☆35Oct 5, 2022Updated 3 years ago
Thinklab-SJTU / BiLAF
View on GitHub
Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"
☆14Feb 11, 2025Updated last year
ryylcc / OWSOL
View on GitHub
☆15Feb 18, 2024Updated 2 years ago
Acciorocketships / pymarl2
View on GitHub
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
☆12Jul 29, 2023Updated 3 years ago
YUE-FAN / SSB
View on GitHub
☆10Mar 24, 2024Updated 2 years ago
anandcu3 / Federated-Learning-for-Remote-Sensing
View on GitHub
Federated Learning Experiments for Remote Sensing image data using convolution neural networks
☆17Aug 5, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
StoneT2000 / trajectorytranslation
View on GitHub
Code for Abstract-to-Executable Trajectory Translation for One Shot Task Generalization (ICML 2023)
☆23May 12, 2023Updated 3 years ago
rrti / maxq
View on GitHub
hierarchical Q-learning implementation
☆11Jun 9, 2015Updated 11 years ago
daniellawson9999 / online-decision-transformer
View on GitHub
An unofficial implementation for online decision transformer
☆41Sep 20, 2022Updated 3 years ago
nerdimite / ntm
View on GitHub
A PyTorch Implementation of Neural Turing Machine
☆14Jul 24, 2020Updated 6 years ago
alexfanjn / GANI
View on GitHub
The relevant codes for "GANI: Global Attacks on Graph Neural Networks via Imperceptible Node Injections".
☆14Mar 21, 2024Updated 2 years ago
fusion-ml / trajectory-information-rl
View on GitHub
Bayesian active RL (BARL) and trajectory information planning (TIP)
☆26Oct 11, 2022Updated 3 years ago
hxt-tg / cimnet
View on GitHub
A pure C++ library for simulations on complex networks. It follow the standard of C++11.
☆14Nov 26, 2022Updated 3 years ago
saper0 / adversarial_training
View on GitHub
Codebase used to generate the results for NeurIPS23 "Adversarial Training for Graph Neural Networks: Pitfalls, Solutions, and New Directi…
☆13Dec 8, 2023Updated 2 years ago
kleinzcy / Cr-KD-NCD
View on GitHub
☆16Nov 15, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
Div-Infinity / LISA
View on GitHub
(NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation
☆29Feb 22, 2023Updated 3 years ago
camall3n / markov-state-abstractions
View on GitHub
Image-based gridworld experiment for learning Markov state abstractions
☆20Sep 16, 2024Updated last year
aadharna / UntouchableThunder
View on GitHub
Co-evolution of agents and environments in GVG-AI
☆17Aug 12, 2021Updated 4 years ago
jparkerholder / DvD_ES
View on GitHub
Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is t…
☆45Oct 29, 2020Updated 5 years ago
shivamsaboo17 / Policy-Gradient-PyTorch
View on GitHub
Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.
☆16Apr 1, 2021Updated 5 years ago
sbelharbi / fcam-wsol
View on GitHub
Pytorch implementation of F-CAM. Paper: "F-CAM: Full Resolution Class Activation Maps via Guided Parametric Upscaling".
☆15Jan 21, 2023Updated 3 years ago
sgvaze / clevr4
View on GitHub
Starter notebook and utilities for the Clevr-4 dataset
☆17Nov 1, 2023Updated 2 years ago
facebookresearch / cascade
View on GitHub
Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).
☆30Oct 25, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ryanxhr / POR
View on GitHub
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
☆58Apr 6, 2023Updated 3 years ago
danijar / director
View on GitHub
Deep Hierarchical Planning from Pixels
☆122Dec 21, 2022Updated 3 years ago
RajGhugare19 / alm
View on GitHub
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
☆82Mar 9, 2023Updated 3 years ago
seoulai / gym
View on GitHub
Seoul AI Gym is a toolkit for developing AI algorithms.
☆31Dec 15, 2018Updated 7 years ago
zjs123 / EvoExplore
View on GitHub
☆17Jul 13, 2022Updated 4 years ago
ruizhaogit / music
View on GitHub
Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)
☆39Mar 1, 2021Updated 5 years ago
pairlab / d2rl
View on GitHub
Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"
☆40Jan 22, 2021Updated 5 years ago