facebookresearch / taskmetLinks
TaskMet Task-driven Metric Learning for Model Learning
☆20Updated last year
Alternatives and similar repositories for taskmet
Users that are interested in taskmet are comparing it to the libraries listed below
Sorting:
- Neural Fixed-Point Acceleration for Convex Optimization☆29Updated 3 years ago
- ☆18Updated 4 years ago
- ☆19Updated 4 years ago
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Updated 2 years ago
- Generalised UDRL☆37Updated 3 years ago
- [AutoML'22] Bayesian Generational Population-based Training (BG-PBT)☆29Updated 3 years ago
- Understanding RL vision Distill article☆25Updated 2 years ago
- Variational Reinforcement Learning☆16Updated last year
- Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)☆43Updated 3 years ago
- Repo for the paper "Landscape Surrogate Learning Decision Losses for Mathematical Optimization Under Partial Information"☆38Updated 2 years ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Updated 2 years ago
- ☆13Updated last year
- Supplementary Data for Evolving Reinforcement Learning Algorithms☆47Updated 4 years ago
- Repo for ICML'23 paper SurCo Learning Linear Surrogates For Combinatorial Nonlinear Optimization Problems☆19Updated 2 years ago
- 🤖 Reinforcement Learning paper summaries, notebooks, and articles.☆26Updated 5 years ago
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆41Updated this week
- ☆16Updated 3 years ago
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Updated 5 years ago
- Representation Learning in RL☆13Updated 3 years ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Updated last year
- Repo to reproduce the First-Explore paper results☆38Updated 11 months ago
- ☆29Updated 3 years ago
- Implementations of Curious Replay for model-based adaptation.☆43Updated 2 years ago
- INTeractive learning via REPresentatIon Discovery☆37Updated last year
- A2C is a special case of PPO!☆22Updated 3 years ago
- An adaptive training algorithm for residual network☆17Updated 5 years ago
- ☆10Updated 2 years ago
- Open source code for paper "On the Learning and Learnability of Quasimetrics".☆32Updated 3 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 4 years ago
- Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).☆28Updated 4 years ago