Implementation of Eligibility Traces with Neural Networks in PyTorch and Tensorflow 2.0
☆26Sep 10, 2021Updated 4 years ago
Alternatives and similar repositories for Deep-Eligibility-Traces
Users that are interested in Deep-Eligibility-Traces are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Oct 20, 2023Updated 2 years ago
- Implementation of OpenAI's Evolution Strategies in PyTorch.☆20Apr 22, 2020Updated 6 years ago
- Hierarchical Attention in Reinforcement Learning for Stock Order Executions☆32Apr 7, 2021Updated 5 years ago
- ☆27Mar 11, 2025Updated last year
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Nov 28, 2022Updated 3 years ago
- Code to reproduce results from the paper: Prediction and Control in Continual Reinforcement Learning, NeurIPS 2023.☆13May 10, 2024Updated 2 years ago
- Experimentation with Streamlit for personal LLM tool☆15Jun 19, 2023Updated 2 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 3 years ago
- A tutorial on doing RL research in Julia using both Jupyter notebooks and normal project structures.☆10Jun 23, 2021Updated 4 years ago
- Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)☆13Jun 11, 2025Updated 11 months ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆29Dec 9, 2021Updated 4 years ago
- C++ Thread Pool implementation base on POSIX pthread☆14Mar 17, 2015Updated 11 years ago
- Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021☆14Dec 11, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jul 25, 2021Updated 4 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Code for magnetic mirror descent.☆18Oct 5, 2023Updated 2 years ago
- Implementation of the psquare algorithm for quantile value estimation☆10Apr 21, 2024Updated 2 years ago
- The Atlas Built For Humanoid Enthusiasts☆83May 12, 2026Updated last week
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Jan 22, 2020Updated 6 years ago
- ☆11Feb 13, 2021Updated 5 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- A set of solutions to ETHZ ROS lectures☆13Jul 19, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 2 years ago
- Dreamer on JAX☆16Jan 19, 2022Updated 4 years ago
- This repository demonstrates the application of our proposed task-free continual learning method on a synthetic experiment.☆13Jun 24, 2019Updated 6 years ago
- Co-training for Policy Learning☆13Aug 8, 2019Updated 6 years ago
- ☆17Mar 23, 2025Updated last year
- Stock Trading Model using Q Learning☆10Dec 16, 2020Updated 5 years ago
- Montvieux has developed “The hunting of the PLARK” Artificial Intelligence (AI) testbed to support a Hackathon activity at the Alan Turin…☆18May 22, 2023Updated 2 years ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated last year
- ☆10Nov 23, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Robust Optimal Control for Flight Planning☆13May 12, 2025Updated last year
- A Python implementation of the SARSA λ reinforcement learning algorithm☆12Mar 6, 2019Updated 7 years ago
- ☆23Nov 9, 2021Updated 4 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- MetaArcade is a configurable environment suite for meta-learning☆16Oct 19, 2022Updated 3 years ago
- ☆10Jun 5, 2025Updated 11 months ago
- Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)☆12Nov 30, 2021Updated 4 years ago