the solustion to https://openai.com/requests-for-research
☆12Mar 23, 2017Updated 9 years ago
Alternatives and similar repositories for RFR-solution
Users that are interested in RFR-solution are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- Python Reinforcement Learning Algorithms for the Arcade Learning Environment☆12Jun 15, 2015Updated 10 years ago
- ☆19Apr 25, 2016Updated 9 years ago
- TensorFlow implementation of Pointer Networks☆12Aug 30, 2016Updated 9 years ago
- Parallelized Cross Entropy Method☆14Jul 26, 2023Updated 2 years ago
- ☆23Oct 7, 2018Updated 7 years ago
- Deep reinforcement learning in ViZDoom (using Tensorflow)☆19Jan 25, 2018Updated 8 years ago
- Tensorflow implementation of the map reading algorithm described in ‘Teaching a Machine to Read Maps with Deep Reinforcement Learning’☆32Nov 14, 2017Updated 8 years ago
- This is the code for "Learning Sentiment Memories for Sentiment Modification without Parallel Data".☆55Dec 18, 2018Updated 7 years ago
- ☆12Sep 17, 2022Updated 3 years ago
- A2C, ACKTR and A2T implementations for ViZDoom☆10Dec 18, 2017Updated 8 years ago
- LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing☆44Jan 30, 2026Updated last month
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- Tensorflow Implementation for "Noisy network for exploration"☆31Jul 17, 2017Updated 8 years ago
- Maddpg_flight code☆11Jul 4, 2018Updated 7 years ago
- Chinese Natural Language Correction via Language Model☆15Sep 14, 2017Updated 8 years ago
- A simple middleware to improving GPU utilization then speedup online inference.☆19Feb 22, 2021Updated 5 years ago
- in progress☆108Jun 11, 2017Updated 8 years ago
- Proceedings of ICML 2018☆39Feb 23, 2026Updated last month
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Dec 31, 2016Updated 9 years ago
- PyOblige is Python wrapper for OBLIGE - random level generator for Doom☆11Jul 2, 2018Updated 7 years ago
- ☆14Jun 21, 2016Updated 9 years ago
- Implement Google Deep Minds DQN for multiple agents for a grid world environment where vehicles must pick up customers.☆29Mar 7, 2018Updated 8 years ago
- An implementation of "Pointer Networks" in Tensorflow☆44Jul 21, 2017Updated 8 years ago
- graduation thesis latex template for sustcer☆13Apr 6, 2015Updated 10 years ago
- ☆47Jun 19, 2018Updated 7 years ago
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- ☆12Feb 26, 2024Updated 2 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- ☆10Mar 24, 2020Updated 5 years ago
- This is a browser extension to show the "table of content" of *.md page☆11Jan 4, 2023Updated 3 years ago
- Progressive Attention Networks☆12Oct 25, 2016Updated 9 years ago
- Separating value functions across time-scales.☆17May 13, 2019Updated 6 years ago
- This project was created for Unity ML-Agents Challenge - https://connect.unity.com/challenges/ml-agents-1☆12Aug 15, 2020Updated 5 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- Environments with IC3Net paper☆15Jan 8, 2019Updated 7 years ago