VishalPallagani / LLMsforPlanningLab-AAAI24Links
Harnessing Large Language Models for Planning: A Lab on Strategies for Success and Mitigation of Pitfalls @ AAAI-24
☆16Updated 11 months ago
Alternatives and similar repositories for LLMsforPlanningLab-AAAI24
Users that are interested in LLMsforPlanningLab-AAAI24 are comparing it to the libraries listed below
Sorting:
- A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm☆27Updated 4 years ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆39Updated 3 years ago
- NLPGym - A toolkit to develop RL agents to solve NLP tasks.☆202Updated 3 years ago
- EasyRL: An easy-to-use and comprehensive reinforcement learning package.☆253Updated 3 years ago
- Ray RLlib tutorial material☆122Updated 3 years ago
- Real-Time Bidding by Reinforcement Learning in Display Advertising☆188Updated 5 years ago
- A simple framework for experimenting with Reinforcement Learning in Python.☆327Updated last year
- A Real-World Benchmark for Reinforcement Learning based Recommender System☆232Updated 2 years ago
- Python Implementations of Monte Carlo Tree Search☆320Updated 4 years ago
- Awesome Deep Reinforcement Learning papers for industrial Search, Recommendation and Advertising.☆219Updated 4 years ago
- Implementations of Reinforcement Learning Algorithm☆44Updated 7 years ago
- the benchmark for finance☆10Updated 2 years ago
- Pointer NN differs from the previous attention attempts in that, instead of using attention to weight hidden units of an encoder, it uses…☆42Updated 4 years ago
- MovieLens recommendation system using reinforcement learning (GYM + PPO)☆50Updated 5 years ago
- Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.☆992Updated 8 months ago
- Learning to Rank in PyTorch☆91Updated 2 years ago
- A LLM training and evaluation benchmark for credit scoring☆65Updated 2 years ago
- ☆10Updated 8 years ago
- Pointer Networks Implementation in Keras☆155Updated 3 years ago
- ☆19Updated 5 years ago
- A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.☆73Updated 5 years ago
- working example of a contextual multi-armed bandit☆55Updated 6 years ago
- Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019☆727Updated 2 years ago
- A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.☆1,196Updated 3 years ago
- Deep Double Q-Learning implementation introduced by Hasselt et al in this paper: https://arxiv.org/abs/1509.06461. It's interfacing with…☆30Updated 9 years ago
- A bot for financial signal☆62Updated 8 years ago
- Python and R tutorial for RLCard in Jupyter Notebook☆97Updated 3 years ago
- Implementation of Schmidhuber's Upside Down Reinforcement Learning paper in PyTorch☆27Updated 6 years ago
- Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning☆153Updated 7 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆559Updated 2 years ago