Neuronal Circuit Policies
☆41Jul 21, 2022Updated 3 years ago
Alternatives and similar repositories for ordinary_neural_circuits
Users that are interested in ordinary_neural_circuits are comparing it to the libraries listed below
Sorting:
- Implementation of Selective Clustering Annotated using Modes of Projections☆11May 19, 2020Updated 5 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- ☆13Mar 11, 2018Updated 7 years ago
- Code used in our paper "Robust Deep Reinforment Learning through Adversarial Loss"☆33Oct 3, 2023Updated 2 years ago
- Deep neural architecture research framework☆12Mar 24, 2023Updated 2 years ago
- Evolution of Discrete data with Reinforcement Learning☆13Dec 8, 2019Updated 6 years ago
- An artificial life screensaver.☆14Dec 5, 2018Updated 7 years ago
- An extensible, dynamic and blazing fast derivatives trading engine☆12Feb 27, 2023Updated 3 years ago
- examples of Neural Autoregressive Flows☆13Aug 14, 2018Updated 7 years ago
- f-divergence based t-SNE☆16Oct 9, 2022Updated 3 years ago
- Github Repo for CARL: Cautious Adaptation for RL in Safety Critical Settings☆14Nov 22, 2022Updated 3 years ago
- 网格交易计算服务(django)☆13Mar 12, 2019Updated 6 years ago
- Variance Networks: When Expectation Does Not Meet Your Expectations, ICLR 2019☆39Jan 31, 2020Updated 6 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆15Dec 8, 2020Updated 5 years ago
- Code that goes along with paper http://arxiv.org/abs/1801.01952☆46Mar 6, 2018Updated 8 years ago
- time-dependent Hamilton-Jacobi PDEs (http://www.cs.columbia.edu/~cxz/TimeDepHJB/)☆14Feb 5, 2017Updated 9 years ago
- Recurrent Discounted Attention unit (RDA) for Tensorflow☆22Mar 12, 2018Updated 7 years ago
- ☆19Dec 19, 2018Updated 7 years ago
- ☆22Aug 10, 2022Updated 3 years ago
- Implementation for "Statistical arbitrage in the US equities market" by Marco Avellaneda and Jeong-hyun Lee☆27Dec 10, 2018Updated 7 years ago
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Randomized Value Functions via Multiplicative Normalizing Flows☆17Jan 1, 2023Updated 3 years ago
- Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…☆48Apr 14, 2019Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- D ratio is a performance metric to analyse the efficiency of algorithms that predict asset return or asset prices☆25Feb 22, 2024Updated 2 years ago
- Neurosymbolic transformers for multi-agent communication.☆22Oct 22, 2020Updated 5 years ago
- Disentangling Motion, Foreground and Background Features in Videos☆26Dec 13, 2017Updated 8 years ago
- This is a repository for enabling collaborative and proper practices for financial machine learning.☆29Updated this week
- PyTorch - Implicit Quantile Networks - Quantile Regression - C51☆22Jul 26, 2019Updated 6 years ago
- ☆28Nov 28, 2021Updated 4 years ago
- Sample BitMEX Market Making Bot☆28Apr 8, 2021Updated 4 years ago
- ☆23Jun 8, 2021Updated 4 years ago
- Official repo for our AAAI'21 paper, https://arxiv.org/abs/2007.12354☆28Jul 14, 2021Updated 4 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- PyTorch implementation of D4PG with the SOTA IQN Critic instead of C51. Implementation includes also the extensions Munchausen RL and D2R…☆24Apr 7, 2021Updated 4 years ago
- Repository for studying distributional rl☆30Feb 2, 2025Updated last year
- Proximal Policy Option-Critic☆26Jan 4, 2019Updated 7 years ago