facebookresearch / macta
MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection
☆46Updated 2 years ago
Alternatives and similar repositories for macta
Users that are interested in macta are comparing it to the libraries listed below
Sorting:
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆28Updated 6 months ago
- Implementation of BC-IRL and other IRL baselines☆28Updated last year
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆33Updated last year
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago
- Causal Analysis of Agent Behavior for AI Safety☆18Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆25Updated 2 weeks ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆16Updated 10 months ago
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Updated 11 months ago
- Learn online intrinsic rewards from LLM feedback☆37Updated 5 months ago
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆22Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆37Updated last year
- Repo to reproduce the First-Explore paper results☆37Updated 4 months ago
- ☆28Updated 10 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 8 months ago
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆84Updated last year
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- ☆34Updated 2 years ago
- ☆14Updated last year
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆19Updated 3 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆55Updated last year
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆31Updated 7 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆62Updated last year
- ☆33Updated last year
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆57Updated 2 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 11 months ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆152Updated 2 years ago
- ☆43Updated 9 months ago
- Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024☆66Updated 11 months ago