Implementation of Scheduled Policy Optimization for task-oriented language grouding
☆29Jul 16, 2018Updated 7 years ago
Alternatives and similar repositories for walk_the_blocks
Users that are interested in walk_the_blocks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Blocks World -- Simulator, Code, and Models (Misra et al. EMNLP 2017)☆40Feb 7, 2019Updated 7 years ago
- Tree-LSTM + Self-Structured Attention -- a method to summarize textual data by topics☆10Apr 26, 2018Updated 8 years ago
- A TensorFlow implementation of dependency-based word embeddings (dependency-based word2vec)☆12Jan 26, 2016Updated 10 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- Code for Emergent Translation in Multi-Agent Communication☆82Jun 6, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Cornell Instruction Following Framework☆35Oct 11, 2021Updated 4 years ago
- RL framework for embodied agents based on PyTorch☆11Apr 11, 2019Updated 7 years ago
- ☆10Sep 29, 2017Updated 8 years ago
- ☆16Mar 2, 2019Updated 7 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- Weakly Supervised Topic Segmentation and Labeling☆32Jan 16, 2022Updated 4 years ago
- Real-time price prediction in P2P energy systems☆12May 6, 2022Updated 4 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Professor Forcing, NIPS'16☆45Apr 3, 2017Updated 9 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Separating value functions across time-scales.☆17May 13, 2019Updated 7 years ago
- Deep Reinforcement Learning for Multi Agent Soccer☆16Dec 15, 2016Updated 9 years ago
- Source code for the AI2 Reasoning Challenge (ARC) submission.☆16Dec 8, 2022Updated 3 years ago
- A (mixed integer) linear optimisation model for local energy systems☆13May 27, 2021Updated 5 years ago
- Training neural networks with back-prop, feedback-alignment and direct feedback-alignment☆11Mar 20, 2017Updated 9 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Nov 22, 2017Updated 8 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26May 4, 2017Updated 9 years ago
- energy management codes developed in the past☆20Sep 26, 2021Updated 4 years ago
- A2C for GVG-AI☆22Nov 7, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 8 years ago
- ☆29Jan 25, 2018Updated 8 years ago
- A batch (multiple concurrent sequence pairs) implementation of Dynamic Time Warping (DTW) in Theano☆10Sep 13, 2015Updated 10 years ago
- Implementation of Multi-Agent Deep Deterministic Policy Gradients☆39Mar 28, 2018Updated 8 years ago
- course project: a simple implementation of Q learning and MPC☆19May 26, 2021Updated 5 years ago
- Attention is All You Need in Sonnet☆38Aug 30, 2017Updated 8 years ago
- in progress☆60Feb 19, 2016Updated 10 years ago
- Keras Implementation of TD3(Twin Delayed DDPG) with PER(Prioritized Experience Replay) option on OpenAI gym framework☆11May 29, 2021Updated 5 years ago
- Code for Learning idiolectal style variation in online register☆10May 18, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reinforcement learning models in ViZDoom environment☆130Mar 9, 2022Updated 4 years ago
- Multi agent energy sharing in zero energy communities using Deep Reinforcement Learning☆11Aug 27, 2018Updated 7 years ago
- Atari gauntlet for RL agents☆29Mar 18, 2017Updated 9 years ago
- Repository containing supplementary materials and code for "JuMP: A Modeling Language for Mathematical Optimization"☆27Mar 1, 2016Updated 10 years ago
- A curated list of awesome Artificial Life simulators, papers and resources.☆15Dec 9, 2025Updated 6 months ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Contains all Kaggle meetup documents: tutorials, examples etc.☆12Jul 3, 2015Updated 10 years ago