facebookresearch / macta
MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection
☆46Updated last year
Alternatives and similar repositories for macta:
Users that are interested in macta are comparing it to the libraries listed below
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Implementation of BC-IRL and other IRL baselines☆27Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated last year
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆33Updated last year
- Learn online intrinsic rewards from LLM feedback☆35Updated 3 months ago
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆10Updated 9 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆53Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Code for the paper "Understanding RL Vision"☆46Updated 2 years ago
- Generative cellular automaton-like learning environments for RL.☆19Updated 2 months ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 2 years ago
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆22Updated last year
- Minimal code for A Generalist Agent☆39Updated 2 years ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆23Updated 5 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 9 months ago
- Repo to reproduce the First-Explore paper results☆37Updated 3 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆27Updated 5 months ago
- ☆27Updated 9 months ago
- Pytorch implementation of the Gato paper from Deepmind☆13Updated 2 years ago
- ☆31Updated last year
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆9Updated last year
- ☆26Updated 11 months ago
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆83Updated last year
- ☆34Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆30Updated 5 months ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆88Updated last year
- A web based platform for collecting human actions in reinforcement learning environments☆28Updated last year
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆35Updated last year
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆13Updated 2 months ago
- Official code for "World Models via Policy-Guided Trajectory Diffusion", TMLR 2024☆62Updated last year