lns/memoire

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lns/memoire)

lns / memoire

☆18

Alternatives and similar repositories for memoire

Users that are interested in memoire are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lns / dapo
View on GitHub
Source code for the paper "Divergence-Augmented Policy Optimization"
☆37Nov 28, 2019Updated 6 years ago
behaviorguidedRL / BGRL
View on GitHub
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Jun 24, 2020Updated 6 years ago
hughperkins / torch-modder-notes
View on GitHub
Notes for torch maintainers/modders
☆10Mar 29, 2016Updated 10 years ago
tencent-ailab / tleague_projpage
View on GitHub
☆151Dec 9, 2024Updated last year
phraust1612 / MinervaSc2
View on GitHub
machine learning project using DeepMind's PySc2
☆12Aug 29, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
IouJenLiu / HTS-RL
View on GitHub
☆21Dec 22, 2020Updated 5 years ago
rlseminar / rlseminar.github.io
View on GitHub
Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.
☆21Nov 17, 2023Updated 2 years ago
jparkerholder / PB2
View on GitHub
Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.
☆20Apr 13, 2021Updated 5 years ago
aypan17 / reward-misspecification
View on GitHub
☆10Mar 13, 2023Updated 3 years ago
tianbingsz / SVRG
View on GitHub
Stochastic Variance Reduction Policy Gradient Estimation
☆11Nov 6, 2018Updated 7 years ago
LinZichuan / AdMRL
View on GitHub
Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)
☆35Mar 6, 2021Updated 5 years ago
sii-yingwen / rommeo
View on GitHub
IJCAI 2019 - Regularized Opponent Model with Maximum Entropy Objective (ROMMEO)
☆23Dec 8, 2022Updated 3 years ago
bkj / pbt
View on GitHub
Population Based Training, Figure 2
☆25Dec 2, 2017Updated 8 years ago
sjtu-marl / bd_rd_psro
View on GitHub
Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
☆24Feb 27, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ArmstrongWall / lsd-slam
View on GitHub
☆12May 8, 2020Updated 6 years ago
diversepsro / diverse_psro
View on GitHub
☆22May 20, 2021Updated 5 years ago
muupan / predictron
View on GitHub
WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer
☆11Dec 31, 2016Updated 9 years ago
JBLanier / pipeline-psro
View on GitHub
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
☆57Aug 30, 2024Updated last year
ruizhaogit / music
View on GitHub
Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)
☆39Mar 1, 2021Updated 5 years ago
lamda-bbo / madac
View on GitHub
Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”
☆26Mar 6, 2023Updated 3 years ago
shaktikshri / adaptiveSystems
View on GitHub
RL CIRL Research
☆13Dec 8, 2022Updated 3 years ago
tsinghua-fib-lab / UGI
View on GitHub
Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City
☆12Dec 17, 2023Updated 2 years ago
sauxpa / Quant
View on GitHub
Quant finance scripts
☆15Apr 13, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
flint-xf-fan / Federated-RLHF
View on GitHub
[AAMAS 2025] Privacy-preserving and Personalized RLHF, with convergence guarantees. The Code contains experiments for training multiple i…
☆16Apr 16, 2025Updated last year
Breakend / ReproducibilityInContinuousPolicyGradientMethods
View on GitHub
These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…
☆17Sep 20, 2017Updated 8 years ago
noambrown / acpc_poker_gui_client
View on GitHub
Rails application that allows humans to play poker matches managed by the Annual Computer Poker Competition's Dealer program in a web GUI…
☆11Apr 25, 2015Updated 11 years ago
flowersteam / curious
View on GitHub
Implementation of CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning
☆27May 15, 2020Updated 6 years ago
SUBER-Team / SUBER
View on GitHub
This repository accompanies our research paper titled "An LLM-based Recommender System Environment".
☆17Jul 15, 2024Updated 2 years ago
wisnunugroho21 / reinforcement_learning_v_mpo
View on GitHub
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆16Oct 23, 2021Updated 4 years ago
alihussainmeer / Data-Analysis-to-predict-the-CO2-Emission
View on GitHub
We have a dataset which contains various features of the car based on which we predict the Carbon dioxide emission.
☆15Oct 3, 2018Updated 7 years ago
sunkairan / MapReduce-Based-Deep-Learning
View on GitHub
2013 Fall Cloud Computing Project for Nerve Cloud group: MapReduce-Based Deep Learning
☆15Dec 2, 2013Updated 12 years ago
PhilippeMorere / EMU-Q
View on GitHub
Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.
☆10Nov 8, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
elsheikh21 / population-based-training-of-NNs
View on GitHub
Applying PBT optimization technique to different domains
☆10Oct 16, 2019Updated 6 years ago
atupal / seismicCompetition
View on GitHub
sc14 matlab application
☆14Nov 24, 2014Updated 11 years ago
lilydjwg / udt_py
View on GitHub
Python UDT
☆16Oct 13, 2012Updated 13 years ago
maohangyu / PDiT
View on GitHub
PDiT: Interleaving Perception and Decision-making Transformers for Deep Reinforcement Learning. AAMAS 2024 (full paper with oral presenta…
☆10Dec 27, 2023Updated 2 years ago
ankitkv / TD-VAE
View on GitHub
TD-VAE in PyTorch
☆10May 28, 2019Updated 7 years ago
zt95 / infinite-horizon-off-policy-estimation
View on GitHub
☆13Apr 3, 2019Updated 7 years ago
YRussac / WeightedLinearBandits
View on GitHub
Code associated with the NeurIPS19 paper "Weighted Linear Bandits in Non-Stationary Environments"
☆17Nov 14, 2019Updated 6 years ago