Episodic Policy Gradient Training
☆17Mar 1, 2022Updated 4 years ago
Alternatives and similar repositories for EPGT
Users that are interested in EPGT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Dec 16, 2024Updated last year
- Memory-augmented Encoder Decoder Architecture☆14May 18, 2020Updated 6 years ago
- ☆14Jan 29, 2024Updated 2 years ago
- Model-based Episodic Control & Complementary Learning Systems☆17Dec 13, 2021Updated 4 years ago
- Neural Stored-program Memory☆27Dec 8, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Dual Memory Neural Computer☆29Nov 8, 2021Updated 4 years ago
- Variational Memory Encoder-Decoder☆33May 30, 2019Updated 7 years ago
- Uniform Writing & Cached Uniform Writing☆28Feb 26, 2019Updated 7 years ago
- Source code for Stable Hadamard Memory☆24May 6, 2025Updated last year
- Reinforcement Learning (PPO) applied to a multiplayer simple card game (Witches)☆10Jun 7, 2020Updated 6 years ago
- Reinforcement Learning☆12Jun 22, 2017Updated 9 years ago
- Implementation of "Decoding-time Realignment of Language Models", ICML 2024.☆21Jun 17, 2024Updated 2 years ago
- Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.☆10Nov 30, 2021Updated 4 years ago
- PyTorch implementations of Reinforcement Learning algorithms in less than 200 lines☆10Apr 3, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆19Mar 1, 2021Updated 5 years ago
- Constrained Decoding Project☆20Nov 10, 2023Updated 2 years ago
- Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …☆17Nov 15, 2020Updated 5 years ago
- Pytorch implementation of InfoGAIL and WGAIL☆19Oct 7, 2022Updated 3 years ago
- Course material for the Intro to SQL Course☆13Mar 15, 2026Updated 3 months ago
- The Easiest Pytorch Implementation of Branching-DQN☆12Feb 10, 2021Updated 5 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 5 years ago
- UNSW - Data Structures and Algorithms (Computing 2)☆14Sep 27, 2017Updated 8 years ago
- Graph-based Reinforcement Learning☆16Jul 9, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- A reference operating system for embedded platforms, with initial bring-up on Beaglebone Black (ARM Cortex-A8).☆93Updated this week
- Code for the paper "Spectral Editing of Activations for Large Language Model Alignments"☆31Dec 20, 2024Updated last year
- Nowadays Using machine learning methods at simulations systems has been gaining importance with spreading and growing machine learning me…☆25Nov 4, 2025Updated 7 months ago
- [WACV 2024] Domain Generalisation via Risk Distribution Matching☆23Sep 19, 2024Updated last year
- Inverse Constrained Reinforcement Learning (ICML 2021)☆28Aug 18, 2021Updated 4 years ago
- COMP9313 Big Data Management☆10Feb 11, 2018Updated 8 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 4 years ago
- (ICLR 2021) Learning to Represent Action Values as a Hypergraph on the Action Vertices☆23Jun 22, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The official code release for Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization☆36Mar 9, 2025Updated last year
- Code implementation of: "Graying the black box: Understanding DQNs"☆20Feb 23, 2017Updated 9 years ago
- A2C is a special case of PPO!☆23May 20, 2022Updated 4 years ago
- MatchFlow is a full-stack invoice processing and purchase-order matching system—built on Clean Architecture with a React+Nginx front-end,…☆39Jul 3, 2025Updated last year
- MADRL project solving chess environment using PPO with two different methods: 2 agents/networks and a single agent/network.☆23Apr 1, 2023Updated 3 years ago
- Python code for robot dynamic simulation, analysis, control and planning☆24Apr 10, 2024Updated 2 years ago
- Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization☆47Jul 28, 2024Updated last year