karush17 / Deep-Eligibility-TracesView external linksLinks
Implementation of Eligibility Traces with Neural Networks in PyTorch and Tensorflow 2.0
☆26Sep 10, 2021Updated 4 years ago
Alternatives and similar repositories for Deep-Eligibility-Traces
Users that are interested in Deep-Eligibility-Traces are comparing it to the libraries listed below
Sorting:
- Energy-based Surprise Minimization for Multi-Agent Value Factorization☆12Oct 20, 2023Updated 2 years ago
- Evolution-based Soft Actor-Critic (ESAC)☆42Jul 25, 2024Updated last year
- ☆27Mar 11, 2025Updated 11 months ago
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆29Dec 9, 2021Updated 4 years ago
- A library for developing and applying Seldonian algorithms☆12Jan 13, 2024Updated 2 years ago
- Real-Time RTUs☆11Jan 2, 2025Updated last year
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- Set of utilities for using Quantum-Espresso with ASE and ipython notebooks.☆12Jul 31, 2017Updated 8 years ago
- A WIP library to control B1500 and similar testers via the VISA protocol, built on pyvisa☆11Nov 4, 2016Updated 9 years ago
- ☆10Jun 5, 2025Updated 8 months ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- ☆11Jul 3, 2023Updated 2 years ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"☆12Mar 25, 2025Updated 10 months ago
- ☆11Jul 25, 2021Updated 4 years ago
- A tutorial on doing RL research in Julia using both Jupyter notebooks and normal project structures.☆10Jun 23, 2021Updated 4 years ago
- Progress, Notes, Summaries and a lot of Questions on Machine Learning☆55Jan 22, 2020Updated 6 years ago
- A GPU-accelerated toolbox for hyperbolic PDEs in a weaker (viscosity) sense. It leverages the integral to the solution of the conservatio…☆11Jan 30, 2026Updated 2 weeks ago
- MetaArcade is a configurable environment suite for meta-learning☆16Oct 19, 2022Updated 3 years ago
- PyTorch implementation of DreamerV3 from "Mastering Diverse Domains with World Models"☆14Aug 8, 2025Updated 6 months ago
- Stock Trading Model using Q Learning☆10Dec 16, 2020Updated 5 years ago
- Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces, NeurIPS 2021☆13Dec 11, 2021Updated 4 years ago
- Experimentation with Streamlit for personal LLM tool☆15Jun 19, 2023Updated 2 years ago
- this is for the ACM MM paper---Backdoor Attack on Crowd Counting☆17Jul 10, 2022Updated 3 years ago
- Code to reproduce results from the paper: Prediction and Control in Continual Reinforcement Learning, NeurIPS 2023.☆12May 10, 2024Updated last year
- A set of solutions to ETHZ ROS lectures☆13Jul 19, 2017Updated 8 years ago
- ☆11Feb 13, 2021Updated 5 years ago
- High-Performance Machine Learning Primitives☆13Apr 17, 2021Updated 4 years ago
- Reinforcement learning on gridworld with Q-learning☆10Jan 28, 2017Updated 9 years ago
- ☆16Feb 15, 2023Updated 2 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- Comprehensive Implementation of Proximal Policy Optimization☆12Aug 3, 2021Updated 4 years ago
- ☆23Nov 9, 2021Updated 4 years ago
- ☆14Nov 21, 2022Updated 3 years ago
- Code for [NeurIPS'2019 Spotlight] Policy Continuation with Hindsight Inverse Dynamics☆15Jan 7, 2020Updated 6 years ago
- ☆15Mar 28, 2022Updated 3 years ago
- Co-training for Policy Learning☆13Aug 8, 2019Updated 6 years ago
- Boids are a way of modeling the complex flocking behavior of birds as well as many marine life including schools of fish; the simple rule…☆19Dec 31, 2019Updated 6 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆143Aug 2, 2024Updated last year