Keras Implementation of DDPG(Deep Deterministic Policy Gradient) with PER(Prioritized Experience Replay) option on OpenAI gym framework
☆13Mar 25, 2023Updated 3 years ago
Alternatives and similar repositories for gym-ddpg-keras
Users that are interested in gym-ddpg-keras are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python library for learning and verification of neural networks and other machine learning models☆14Sep 18, 2025Updated 6 months ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Feb 9, 2024Updated 2 years ago
- A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization☆17Dec 22, 2024Updated last year
- Construction of a lumbar weapon detection system through CCTV cameras with the help of artificial intelligence neural network (YOLO_v5), …☆18Apr 15, 2022Updated 3 years ago
- ☆10Mar 13, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆11Oct 5, 2020Updated 5 years ago
- ☆11Nov 18, 2023Updated 2 years ago
- Towards Visual Explanations for Convolutional Neural Networks via Input Resampling☆13Aug 16, 2017Updated 8 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆11Oct 25, 2021Updated 4 years ago
- Comparing sequential forecasters via confidence sequences & e-processes☆10Oct 24, 2023Updated 2 years ago
- Code for ICML 25 paper "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆50Jun 30, 2025Updated 9 months ago
- ☆13Feb 10, 2023Updated 3 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Code for ICML 2021 paper "Regularizing towards Causal Invariance: Linear Models with Proxies" (ICML 2021)☆11Mar 14, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Teaching a humanoid to walk(ish), then displaying in your browser (using tensorflow.js and reinforcement learning)☆10Sep 7, 2020Updated 5 years ago
- CascadER: Cross-Modal Cascading for Knowledge Graph Link Prediction (arXiv 22)☆13Jun 17, 2022Updated 3 years ago
- ☆13Jul 26, 2023Updated 2 years ago
- Code for the paper "Optimal Off-Policy Evaluation from Multiple Logging Policies"☆15Jul 17, 2021Updated 4 years ago
- Public repo containing code to train, visualize, and evaluate semi-supervised topic models and baselines for regression/classification on…☆11Apr 15, 2020Updated 5 years ago
- ICLR 2023: Learning to Extrapolate: A Transductive Approach☆11Aug 15, 2023Updated 2 years ago
- Our paper is titled "NUS-IDS at FinCausal 2021: Dependency Tree in Graph Neural Networks for better Cause-Effect Span Detection".☆13Feb 11, 2022Updated 4 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆15Apr 14, 2025Updated 11 months ago
- ☆11Feb 27, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Offline Policy Evaluation via Adaptive Weighting with Data from Contextual Bandits☆10Oct 21, 2024Updated last year
- paper on dexpilot☆15Oct 14, 2019Updated 6 years ago
- Software package for intertemporal pricing optimization under reference effects and consumer heterogeneity estimation. Please see REAMDE.…☆10Mar 7, 2024Updated 2 years ago
- LLM Skirmish☆45Feb 3, 2026Updated last month
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- Provable Worst Case Guarantees for the Detection of Out-of-Distribution Data☆13Sep 20, 2022Updated 3 years ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks☆14Mar 2, 2023Updated 3 years ago
- ☆11Dec 14, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- CSS-LM: Contrastive Semi-supervised Fine-tuning of Pre-trained Language Models☆12Jul 1, 2023Updated 2 years ago
- Resources for our AAAI 2022 paper: "Unsupervised Editing for Counterfactual Stories".☆11Oct 25, 2022Updated 3 years ago
- Model Primitive Hierarchical Reinforcement Learning☆13Dec 8, 2022Updated 3 years ago
- ☆11Apr 23, 2023Updated 2 years ago
- Objective metrics for measuring visual texture similarity using STSIM features. Supervised by Thrasos Pappas.☆14Oct 4, 2023Updated 2 years ago
- ☆10Oct 26, 2022Updated 3 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆11Jul 10, 2022Updated 3 years ago