facebookresearch / macta
MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection
☆45Updated last year
Related projects ⓘ
Alternatives and complementary repositories for macta
- Implementation of BC-IRL and other IRL baselines☆25Updated last year
- Causal Analysis of Agent Behavior for AI Safety☆17Updated last year
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆41Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 5 months ago
- ☆22Updated 4 months ago
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆32Updated 10 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆23Updated 3 weeks ago
- Efficient World Models with Context-Aware Tokenization. ICML 2024☆84Updated 2 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆52Updated 7 months ago
- ☆16Updated last month
- ☆34Updated last year
- ☆17Updated 5 months ago
- ☆15Updated last month
- Code repository for the paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models"☆23Updated last month
- Official code for "Reward-Free Curricula for Training Robust World Models", ICLR 2024.☆26Updated 9 months ago
- A vast array of Multi-Modal Embodied Robotic Foundation Models!☆24Updated 8 months ago
- MTM Masked Trajectory Models for Prediction, Representation, and Control.☆149Updated last year
- Repo to reproduce the First-Explore paper results☆36Updated 2 weeks ago
- Code for the paper "Understanding RL Vision"☆43Updated last year
- Galactic Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second☆82Updated last year
- ☆36Updated 4 months ago
- A web based platform for collecting human actions in reinforcement learning environments☆27Updated last year
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 2 months ago
- This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".☆20Updated 11 months ago
- ☆25Updated 2 months ago
- OMNI: Open-endedness via Models of human Notions of Interestingness☆39Updated 11 months ago
- ☆68Updated 2 months ago
- ☆39Updated 10 months ago