☆10Aug 18, 2022Updated 3 years ago
Alternatives and similar repositories for rlchina_pbl
Users that are interested in rlchina_pbl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆90Aug 23, 2022Updated 3 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)☆13Aug 17, 2019Updated 6 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- This is the official implementation for IJCAI 2023 Paper: Towards Hierarchical Policy Learning for Conversational Recommendation with Hyp…☆12Sep 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Open-source code for GEAR☆13Dec 3, 2025Updated 3 months ago
- The official implementation of "DVR: Micro-Video Recommendation Optimizing Watch-Time-Gain under Duration Bias" (MM '22)☆18Oct 15, 2022Updated 3 years ago
- [SIGIR 2024] NFARec: A Negative Feedback-Aware Recommender Model.☆12Jan 9, 2025Updated last year
- Reproducing Policy Distillation (DeepMind paper ICLR 2016)☆22Feb 17, 2020Updated 6 years ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆19Dec 26, 2025Updated 3 months ago
- A reinforcement learning package implemented in Torch☆11Jan 24, 2016Updated 10 years ago
- ardrone simulation in gazebo(for kinetic and gazebo 7). Now it can run.☆10Oct 27, 2017Updated 8 years ago
- ☆12Apr 12, 2023Updated 2 years ago
- AlphaZero implementation on Gomoku☆18Feb 26, 2025Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆16Jun 30, 2022Updated 3 years ago
- IPython Notebooks on various things☆14Dec 4, 2017Updated 8 years ago
- Repository for the "Chain-of-Thought Reasoning In The Wild Is Not Always Faithful" paper☆31Nov 28, 2025Updated 4 months ago
- Code for the paper "Deep FTRL-ORW: An Efficient Deep Reinforcement Learning Algorithm for Solving Imperfect Information Extensive-Form Ga…☆11Dec 1, 2022Updated 3 years ago
- Official implementation of HEGNN, a novel high-degree equivariant graph neural network proposed in the NeurIPS 2024 paper 'Are High-Degre…☆33Nov 8, 2024Updated last year
- ☆10Apr 23, 2021Updated 4 years ago
- Black-box Bayesian inference for agent-based models☆32Aug 25, 2024Updated last year
- 早期做的一个基于SSH框架的图书管理系统,作为学习了Struts2,Spring4,Hibernate的初学者第一个开发的整合项目来说,应该具备的一些技能。☆16Aug 14, 2017Updated 8 years ago
- ☆17Oct 12, 2023Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- LaTeX Template Satisfying McMaster Thesis Formatting Requirements☆16May 6, 2014Updated 11 years ago
- ☆13Feb 10, 2021Updated 5 years ago
- This code is to implement the model-free control algorithm as introduced in the paper Model-free control by Michel Fliess and Cedric Join…☆13Nov 29, 2017Updated 8 years ago
- Allows to create images from a given text using the stable-diffusion sdk☆16Sep 1, 2022Updated 3 years ago
- The MAPFpython library is designed for rapid research into multi-agent pathfinding domains.☆12May 18, 2017Updated 8 years ago
- The official repository for guided jailbreak benchmark☆29Jul 28, 2025Updated 8 months ago
- An environment based on JSBSIM aimed at one-to-one close air combat.☆13May 15, 2023Updated 2 years ago
- ☆174Oct 9, 2023Updated 2 years ago
- Solution to Kaggle's Google Research Football Competition☆14Dec 2, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This repository constists of the implementations of the Distance Correlation (DC) and Information Over Bias (IOB) metrics proposed in [li…☆23Oct 16, 2021Updated 4 years ago
- This is a movie recommendation system with tensorflow. Dataset is MovieLens.☆20Aug 9, 2018Updated 7 years ago
- Codes for TPM, a tree based model for watch time prediction☆25Apr 18, 2023Updated 2 years ago
- This repository, "Autonomous Driving System On Various Platforms", details the exploration and implementation of autonomous driving syste…☆10Aug 16, 2021Updated 4 years ago
- RankFormer: Listwise Learning-to-Rank Using Listwide Labels (KDD 2023).☆27Sep 12, 2023Updated 2 years ago
- Adds CityFlow to Gym☆32Nov 15, 2021Updated 4 years ago
- JavaWeb,图书管理系统☆18Oct 28, 2018Updated 7 years ago