ying-wen/rlchina_pbl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ying-wen/rlchina_pbl)

ying-wen / rlchina_pbl

☆10

Alternatives and similar repositories for rlchina_pbl

Users that are interested in rlchina_pbl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jidiai / SummerCourse2022
View on GitHub
☆90Aug 23, 2022Updated 3 years ago
microsoft / strategically_efficient_rl
View on GitHub
More efficient exploration for reinforcement learning in two-player, zero-sum game
☆21Jul 30, 2024Updated last year
wsjeon / multiagent-gail
View on GitHub
multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)
☆13Aug 17, 2019Updated 6 years ago
ryanxhr / BEAR
View on GitHub
Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"
☆11Oct 29, 2019Updated 6 years ago
Snnzhao / DAHCR
View on GitHub
This is the official implementation for IJCAI 2023 Paper: Towards Hierarchical Policy Learning for Conversational Recommendation with Hyp…
☆12Sep 19, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
E-qin / GEAR
View on GitHub
Open-source code for GEAR
☆16Dec 3, 2025Updated 7 months ago
brenting / ANL-2023-example-agent
View on GitHub
☆12Apr 12, 2023Updated 3 years ago
WangXFng / NFARec
View on GitHub
[SIGIR 2024] NFARec: A Negative Feedback-Aware Recommender Model.
☆13Jan 9, 2025Updated last year
DorGetter / Autonomus-Car-CS-Final-Project
View on GitHub
This repository, "Autonomous Driving System On Various Platforms", details the exploration and implementation of autonomous driving syste…
☆10Aug 16, 2021Updated 4 years ago
tsinghua-fib-lab / WTG-DVR
View on GitHub
The official implementation of "DVR: Micro-Video Recommendation Optimizing Watch-Time-Gain under Duration Bias" (MM '22)
☆18Oct 15, 2022Updated 3 years ago
Csh090501 / BookManage
View on GitHub
早期做的一个基于SSH框架的图书管理系统，作为学习了Struts2，Spring4，Hibernate的初学者第一个开发的整合项目来说，应该具备的一些技能。
☆16Aug 14, 2017Updated 8 years ago
ciwang / policydistillation
View on GitHub
Reproducing Policy Distillation (DeepMind paper ICLR 2016)
☆22Feb 17, 2020Updated 6 years ago
nigelyaoj / Quality-Similar-Diversity
View on GitHub
Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning
☆19Dec 26, 2025Updated 7 months ago
icesit / sjtu_drone
View on GitHub
ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.
☆10Oct 27, 2017Updated 8 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
skezle / owl
View on GitHub
☆17Jun 30, 2022Updated 4 years ago
vitchyr / torch-rl
View on GitHub
A reinforcement learning package implemented in Torch
☆11Jan 24, 2016Updated 10 years ago
gokererdogan / Notebooks
View on GitHub
IPython Notebooks on various things
☆14Dec 4, 2017Updated 8 years ago
menglinjian / Deep-FTRL-ORW
View on GitHub
Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…
☆11Dec 1, 2022Updated 3 years ago
Simplified-Reasoning / TRM
View on GitHub
Code repository for the ICML 2026 Oral paper "Characterizing, Evaluating, and Optimizing Complex Reasoning".
☆17Jun 21, 2026Updated last month
BTelsang / Model-free-Control
View on GitHub
This code is to implement the model-free control algorithm as introduced in the paper Model-free control by Michel Fliess and Cedric Join…
☆13Nov 29, 2017Updated 8 years ago
MaxMassi / Face-Swapper
View on GitHub
Generate a diverse dataset of 100 face-swapped images using the Inswapper model for training robust face-swap detection classifiers. 🖼️�…
☆20Updated this week
kazuhirobben / MADQN_for_Global_Routing
View on GitHub
☆11Feb 1, 2022Updated 4 years ago
DavidMChan / MAPFpython
View on GitHub
The MAPFpython library is designed for rapid research into multi-agent pathfinding domains.
☆12May 18, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
luiscruzga / stable-difussion-js
View on GitHub
Allows to create images from a given text using the stable-diffusion sdk
☆16Sep 1, 2022Updated 3 years ago
yannbouteiller / gym-airsimdroneracinglab
View on GitHub
☆13Feb 10, 2021Updated 5 years ago
npvoid / OnlineDoubleOracle
View on GitHub
☆10Apr 23, 2021Updated 5 years ago
AgFeather / Movie_Recommendation_System
View on GitHub
This is a movie recommendation system with tensorflow. Dataset is MovieLens.
☆20Aug 9, 2018Updated 7 years ago
maartenbuyl / rankformer
View on GitHub
RankFormer: Listwise Learning-to-Rank Using Listwide Labels (KDD 2023).
☆26Sep 12, 2023Updated 2 years ago
jackielinxiao / TPM
View on GitHub
Codes for TPM, a tree based model for watch time prediction
☆25Apr 18, 2023Updated 3 years ago
wingsweihua / gym_cityflow
View on GitHub
Adds CityFlow to Gym
☆33Nov 15, 2021Updated 4 years ago
jidiai / ai_lib
View on GitHub
☆174Oct 9, 2023Updated 2 years ago
xiaoshijiu333 / JavaWeb---BookManagementSystem
View on GitHub
JavaWeb，图书管理系统
☆18Oct 28, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Theohhhu / CloseAirCombat_baseline
View on GitHub
An environment based on JSBSIM aimed at one-to-one close air combat.
☆14May 15, 2023Updated 3 years ago
AlexMan2000 / UCB_EECS_127
View on GitHub
☆11Jan 15, 2024Updated 2 years ago
aerdem4 / google-football
View on GitHub
Solution to Kaggle's Google Research Football Competition
☆14Dec 2, 2020Updated 5 years ago
tigert1998 / rl-gobang
View on GitHub
AlphaZero implementation on Gomoku
☆18Feb 26, 2025Updated last year
liangqi / XQStudio
View on GitHub
Xiangqi Notation Software, not my own code.
☆15Oct 11, 2011Updated 14 years ago
getlantern / proxy
View on GitHub
Golang library for core proxying logic
☆21Mar 28, 2024Updated 2 years ago
KellyReddington / TumblrLikesDownloader
View on GitHub
☆13Dec 19, 2018Updated 7 years ago