M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning
☆28Nov 5, 2020Updated 5 years ago
Alternatives and similar repositories for m-curl
Users that are interested in m-curl are comparing it to the libraries listed below
Sorting:
- ☆20Feb 26, 2021Updated 5 years ago
- Multi-task Multi-agent Soft Actor Critic for SMAC☆15Jan 18, 2022Updated 4 years ago
- Tailoring Molecules for Protein Pockets: a Transformer-based Generative Solution for Structured-based Drug Design☆20Jul 26, 2023Updated 2 years ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 10 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 8 months ago
- ☆10Mar 22, 2021Updated 4 years ago
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 4 years ago
- ACL19_Depth_Growing_for_Neural_Machine_Translation☆23Jul 6, 2019Updated 6 years ago
- Multi-objective reinforcement learning for covid-19 control☆12Aug 12, 2021Updated 4 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Solution of KDD cup 2021☆11Jun 16, 2021Updated 4 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- ☆11Apr 23, 2021Updated 4 years ago
- arxiv-daily☆13Jan 25, 2023Updated 3 years ago
- implementation of our self-guided and self-regularized actor-critic algorithm☆30Jan 1, 2023Updated 3 years ago
- This is the code for GA-DRL-Aubo paper☆14Apr 8, 2022Updated 3 years ago
- V-MPO torch version with DMLab30 and GTrXL☆13Mar 1, 2021Updated 5 years ago
- This package is an implementation of Dexterous Ungrasping, which refers to the task of securely transferring an object from the gripper t…☆12Feb 4, 2022Updated 4 years ago
- ☆14Feb 9, 2023Updated 3 years ago
- Code for paper "Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning".☆14May 23, 2021Updated 4 years ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆36Feb 13, 2021Updated 5 years ago
- Self-Supervised Domain Adaptation with Consistency Training☆19Oct 28, 2020Updated 5 years ago
- ☆15Oct 27, 2020Updated 5 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- Pytorch GUI(demo) implementation of CVPR2021 paper and ECCV2020 paper, "Guided Interactive Video Object Segmentation Using Reliability-B…☆18May 3, 2022Updated 3 years ago
- An adaptive training algorithm for residual network☆17Aug 22, 2020Updated 5 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- ☆17Jun 3, 2024Updated last year
- Motion imitation with deep reinforcement learning.☆13Jul 24, 2019Updated 6 years ago
- This repository is an implementation of "MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer"…☆22Jul 6, 2023Updated 2 years ago
- simple demo codes for Learning to Teach with Dynamic Loss Functions☆17Oct 22, 2019Updated 6 years ago
- Federated learning is a distributed learning method that trains a deep network on user devices without collecting data from central serve…☆14Jul 7, 2020Updated 5 years ago
- "Gaussian RAM: Lightweight Image Classification via Stochastic Retina Inspired Glimpse and Reinforcement Learning" (ICCAS 2020)☆16Jun 7, 2022Updated 3 years ago
- Deep learning implementations (Asynchronous Deep Q-Learning) of multiple Game Theory algorithms for adversarial learning (WoLF-PHC, GIGA-…☆15Sep 19, 2017Updated 8 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- This repository collects various works that reproduce DeepSeek R1, as well as works related to DeepSeek R1 and the DeepSeek series.☆19Apr 27, 2025Updated 10 months ago
- ☆22May 20, 2021Updated 4 years ago
- ☆24Oct 26, 2021Updated 4 years ago
- Repository containing code for the paper "Meta-Learning with Sparse Experience Replay for Lifelong Language Learning".☆22Jun 12, 2023Updated 2 years ago