Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design"
☆35Jun 28, 2024Updated last year
Alternatives and similar repositories for groove
Users that are interested in groove are comparing it to the libraries listed below
Sorting:
- Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function☆13Nov 22, 2022Updated 3 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆25Jan 14, 2025Updated last year
- Official codebase for "Sampling For Learnability", published at NeurIPS 2024☆20Oct 21, 2025Updated 4 months ago
- Official implementation of *A Unified Hard-Constraint Framework for Solving Geometrically Complex PDEs*☆18Mar 4, 2023Updated 2 years ago
- 🐝 SwarmBench: Benchmarking LLMs' Swarm Intelligence☆29May 21, 2025Updated 9 months ago
- Implementing Controlled Monte Carlo Diffusions (ICLR 2024)☆17Sep 30, 2024Updated last year
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆24May 11, 2024Updated last year
- Biased matrix factorisation using TensorFlow☆19Jun 30, 2016Updated 9 years ago
- ☆92Jan 21, 2026Updated last month
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆30Dec 15, 2025Updated 2 months ago
- ☆58Sep 23, 2024Updated last year
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Official implementation of the δ-model presented in the ICML 2024 paper "A Distributional Analogue to the Successor Representation".☆25Nov 8, 2024Updated last year
- ☆28Dec 29, 2023Updated 2 years ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆32Nov 7, 2024Updated last year
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code (ICLR 2025).☆73Dec 26, 2024Updated last year
- A Jax/Stax implementation of the general meta learning paper: Oh, J., Hessel, M., Czarnecki, W.M., Xu, Z., van Hasselt, H.P., Singh, S. a…☆23Dec 22, 2020Updated 5 years ago
- Reinforcement Learning inside a 3D soccer simulation☆37Sep 15, 2024Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆374Feb 10, 2026Updated 3 weeks ago
- General-Sum variant of the game Diplomacy for evaluating AIs.☆34Apr 2, 2024Updated last year
- Unified Implementations of Offline Reinforcement Learning Algorithms☆199Dec 19, 2025Updated 2 months ago
- Target driven visual navigation using deep reinforcement learning implemented in Pytorch☆31Jun 22, 2023Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆235Nov 24, 2025Updated 3 months ago
- Baselines for gymnax 🤖☆74Apr 3, 2023Updated 2 years ago
- This repository contains data and code for our EMNLP 2018 paper "BanditSum: Extractive Summarization as a Contextual Bandit.☆29Oct 9, 2019Updated 6 years ago
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- ☆13Dec 16, 2022Updated 3 years ago
- A Python implementation of the Hopfield network used to solve the traveling salesman problem☆10Apr 11, 2019Updated 6 years ago
- INTeractive learning via REPresentatIon Discovery☆36Jun 2, 2024Updated last year
- ☆12Apr 28, 2025Updated 10 months ago
- Solver for Inverse PDE Problems☆13Nov 17, 2024Updated last year
- ☆10Jul 8, 2021Updated 4 years ago
- Official implementation of the paper "RaceMOP: Mapless Online Path Planning for Multi-Agent Autonomous Racing using Residual Policy Learn…☆10Oct 23, 2024Updated last year
- ☆16Feb 22, 2025Updated last year
- Book: Practical Probabilistic Machine Learning in Python☆10Apr 3, 2021Updated 4 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Dec 28, 2017Updated 8 years ago
- ☆10Nov 15, 2023Updated 2 years ago
- DreamSmooth: Improving Model-Based RL with Reward Smoothing (ICLR 2024)☆12May 6, 2024Updated last year
- Factoried Personalized Markov Chains for Next Basket Recommendation in R and Python☆13Jan 7, 2018Updated 8 years ago