thaihungle/EPGT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thaihungle/EPGT)

thaihungle / EPGT

Episodic Policy Gradient Training

☆17

Alternatives and similar repositories for EPGT

Users that are interested in EPGT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thaihungle / PANM
View on GitHub
Source code for paper "Plug, Play, and Generalize: Length Extrapolation with Pointer-Augmented Neural Memory"
☆13Oct 7, 2024Updated last year
thaihungle / MRPO
View on GitHub
☆14Dec 16, 2024Updated last year
thaihungle / MAED
View on GitHub
Memory-augmented Encoder Decoder Architecture
☆14May 18, 2020Updated 6 years ago
thaihungle / AJCAI22-Tutorial
View on GitHub
Demo code for AJCAI22-Tutorial
☆11Dec 7, 2022Updated 3 years ago
thaihungle / Neurocoder
View on GitHub
Code of Neurocoder paper
☆15Feb 17, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
thaihungle / NSM
View on GitHub
Neural Stored-program Memory
☆27Dec 8, 2022Updated 3 years ago
thaihungle / DMNC
View on GitHub
Dual Memory Neural Computer
☆29Nov 8, 2021Updated 4 years ago
thaihungle / SHM
View on GitHub
Source code for Stable Hadamard Memory
☆24May 6, 2025Updated last year
thaihungle / SAM
View on GitHub
Self-attentive Associative Memory & SAM-based Two-Memory Model
☆61May 4, 2022Updated 4 years ago
DA2I2-SLM / DAR
View on GitHub
Source code for the paper: Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention
☆18Apr 16, 2026Updated 3 months ago
dickreuter / tf_rl
View on GitHub
Refinforcement learning framework
☆15Mar 25, 2023Updated 3 years ago
NathanHerr / LLM-First-Search
View on GitHub
☆17Jun 9, 2025Updated last year
yaolu / ordered-prompt
View on GitHub
☆13Dec 13, 2022Updated 3 years ago
vwxyzjn / gym-pysc2
View on GitHub
Gym wrapper for pysc2
☆10Sep 16, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ShawK91 / MERL
View on GitHub
☆11Sep 4, 2019Updated 6 years ago
pucrs-ai-cs / reinforcement
View on GitHub
Reinforcement Learning
☆12Jun 22, 2017Updated 9 years ago
chiajuichuang / COMP9313P
View on GitHub
COMP9313 Big Data Management
☆10Feb 11, 2018Updated 8 years ago
MouseHu / GEM
View on GitHub
☆16Jul 1, 2021Updated 5 years ago
ajaysub110 / RLin200Lines
View on GitHub
PyTorch implementations of Reinforcement Learning algorithms in less than 200 lines
☆10Apr 3, 2020Updated 6 years ago
Aladoro / domain-robust-visual-il
View on GitHub
Domain-Robust Visual Imitation Learning with Mutual Information Constraints code
☆19Mar 1, 2021Updated 5 years ago
MtSomeThree / constrDecoding
View on GitHub
Constrained Decoding Project
☆20Nov 10, 2023Updated 2 years ago
ermongroup / dail
View on GitHub
The Official Implementation of Domain Adaptive Imitation Learning (DAIL)
☆25Oct 26, 2020Updated 5 years ago
Davido111200 / QuestionAnswering_demoVbdi
View on GitHub
This project aims to build an English Question Answering web application. Instructions are given below. Have fun using our program :D
☆19Nov 13, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tgangwani / RL-Indirect-imitation
View on GitHub
Pytorch code for "State-only Imitation with Transition Dynamics Mismatch" (ICLR 2020)
☆20Feb 29, 2020Updated 6 years ago
aws-samples / amazon-sagemaker-tsp-deep-rl
View on GitHub
train, deploy, and make inferences using deep reinforcement learning to solve the Travelling Salesperson Problem
☆19Dec 22, 2023Updated 2 years ago
FangchenLiu / SAIL
View on GitHub
Code for Paper "State Alignment-based Imitation Learning". Under maintenance
☆17May 1, 2020Updated 6 years ago
JSBSim-Team / aeromatic
View on GitHub
A web application to generate configuration files for the JSBSim Flight Dynamics Model.
☆17Jul 6, 2019Updated 7 years ago
geoaigroup / nasa_harvest_boundary_detection_challenge
View on GitHub
☆16Apr 2, 2024Updated 2 years ago
jetnew / SlimeRL
View on GitHub
Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …
☆17Nov 15, 2020Updated 5 years ago
LJC-FVNR / In-context-Time-Series-Predictor
View on GitHub
Implementation of the paper "In-context Time Series Predictor" (ICLR 2025)
☆16Feb 11, 2025Updated last year
gkswamy98 / pillbox
View on GitHub
Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.
☆21Apr 18, 2022Updated 4 years ago
twneale / citation-network-analysis
View on GitHub
Materials for my PyData Boston 2013 talk
☆16Sep 26, 2013Updated 12 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
frycast / SQL_course
View on GitHub
Course material for the Intro to SQL Course
☆13Mar 15, 2026Updated 4 months ago
seolhokim / BipedalWalker-BranchingDQN
View on GitHub
The Easiest Pytorch Implementation of Branching-DQN
☆12Feb 10, 2021Updated 5 years ago
alperenkbd / AircraftFighterSimulationUsingMachineLearning
View on GitHub
Nowadays Using machine learning methods at simulations systems has been gaining importance with spreading and growing machine learning me…
☆26Nov 4, 2025Updated 8 months ago
philipjball / OffCon3
View on GitHub
📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)
☆25Jun 20, 2021Updated 5 years ago
greydanus / ncf
View on GitHub
Nature's Cost Function (NCF). Finding paths of least action with gradient descent.
☆18Mar 30, 2023Updated 3 years ago
KaiyuanGao / Kick_AI_Interview
View on GitHub
整理了机器学习、深度学习、自然语言处理面试中高频知识点，并提供个人答案（仅供参考）。
☆26Aug 8, 2020Updated 5 years ago
jaromiru / NASimEmu-agents
View on GitHub
Deep RL agents for NASimEmu. See also https://github.com/jaromiru/NASimEmu.
☆15Jul 16, 2024Updated 2 years ago