rlcourse-march-17-hugobb created by GitHub Classroom
☆15Jul 3, 2024Updated last year
Alternatives and similar repositories for rlcourse-march-17-hugobb
Users that are interested in rlcourse-march-17-hugobb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Sibling Rivalry and experiments presented in associated paper☆18May 1, 2025Updated last year
- Project on Successor Features in Deep Reinforcement Learning and Transfer Learning☆24Feb 5, 2018Updated 8 years ago
- Design good curriculums for deep reinforcement learning☆14May 18, 2016Updated 10 years ago
- Julia Implementation of the POMCP algorithm for solving POMDPs☆12Aug 6, 2021Updated 4 years ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Jan 27, 2026Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- ☆20Jan 31, 2018Updated 8 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- Reproducing Policy Distillation (DeepMind paper ICLR 2016)☆22Feb 17, 2020Updated 6 years ago
- Simulates agent path planning using A* and Q-Learning in a 2D grid☆12Apr 5, 2014Updated 12 years ago
- ☆10May 13, 2018Updated 8 years ago
- Code for paper "IntPhys: A Benchmark and Dataset for Intuitive Physics".☆29Nov 18, 2019Updated 6 years ago
- Sparse Graphical Memory for Robust Planning☆29Nov 21, 2022Updated 3 years ago
- hierarchical Q-learning implementation☆11Jun 9, 2015Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆142Feb 26, 2019Updated 7 years ago
- Learning Domain-Independent Planning Heuristics over Hypergraphs (ICAPS'20)☆15Mar 21, 2025Updated last year
- Library for action model acquisition from state trace data.☆25Jan 7, 2025Updated last year
- Conditional Random Fields implemented as Lasagne layer☆10Jul 22, 2016Updated 9 years ago
- Lagrangian VAE☆28Jul 27, 2018Updated 7 years ago
- Official PyTorch implementation of Rethinking Guidance Information to Utilize Unlabeled Samples: A Label-Encoding Perspective.☆19Sep 27, 2024Updated last year
- Contrastive Distillation for Incremental Class Learning in Semantic Segmentation☆14Dec 13, 2021Updated 4 years ago
- ☆37Nov 10, 2016Updated 9 years ago
- Layered distributions using FLAX/JAX☆10Dec 13, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Oct 5, 2020Updated 5 years ago
- Simple bit flipping with sparse rewards using HER, similarly to the original paper☆39Feb 25, 2019Updated 7 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆35Nov 28, 2018Updated 7 years ago
- The code of LLaVO☆19Oct 21, 2025Updated 8 months ago
- Domain independent implementation of Monte Carlo Tree Search methods.☆18Jul 30, 2018Updated 7 years ago
- CarND Capstone☆10Apr 2, 2018Updated 8 years ago
- Fuzzy Logic "Fuzzy Associative Memory" (FAM) for fuzzy control systems, decision-making, artificial intelligence / AI, game agents & bots…☆62Dec 14, 2015Updated 10 years ago
- Implementation of Dueling Network Architectures for Deep Reinforcement Learning paper with Pytorch☆14Sep 26, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- ☆15Sep 5, 2016Updated 9 years ago
- OpenLock Environment for OpenAI Gym☆19Feb 16, 2021Updated 5 years ago
- Monte Carlo Tree Search Mario AI☆31Dec 28, 2013Updated 12 years ago
- ☆16Mar 8, 2022Updated 4 years ago
- Code for the paper "A Boolean Task Algebra For Reinforcement Learning"☆11Dec 8, 2022Updated 3 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 7 years ago