A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
☆27Feb 7, 2022Updated 4 years ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- k-means++: a C++ version implement☆18Dec 26, 2017Updated 8 years ago
- ☆22Dec 7, 2023Updated 2 years ago
- ☆23Dec 31, 2020Updated 5 years ago
- Dota 2 API for machine learning☆25Dec 13, 2018Updated 7 years ago
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Jun 11, 2020Updated 5 years ago
- A tool to paste Excel ranges to Reddit☆11Sep 20, 2025Updated 5 months ago
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆13Jan 3, 2023Updated 3 years ago
- Comparative Study and Implementation of Five Factor Model and Myers-Briggs Type Indicator Model☆11Sep 28, 2023Updated 2 years ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- Predict and recommend the news articles, user is most likely to click in real time.☆32Apr 3, 2018Updated 7 years ago
- 李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。☆10Jun 3, 2025Updated 9 months ago
- A comparison of Google SlateQ algorithm with traditional Reinforcement Learning algorithms☆39Dec 27, 2022Updated 3 years ago
- Dataset and codes for SEntFiN☆10May 31, 2023Updated 2 years ago
- 股票/基金/债券的相关信息的协助应用。开发原因主要是不想装太多app,比如集思录,蛋卷之类的,把他们部分数据集合到这个app上☆11Sep 15, 2021Updated 4 years ago
- A distributed Oracle system for IoT data☆11Apr 12, 2023Updated 2 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- 小鸡词典🐤的Alfred🎩插件 咯咯咯☆11Apr 19, 2023Updated 2 years ago
- Inspirational post ids collected from Reddit using pushift.io and RoBERTa☆10Jan 18, 2024Updated 2 years ago
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- DNH Werewolf Discord bot☆13Dec 19, 2024Updated last year
- A Multi Layer Perceptron (MLP) Artificial Neural Network (ANN) Framework Developed in C for Machine Learning (ML) and Deep Learning (DL)☆11May 4, 2025Updated 9 months ago
- ☆13Nov 15, 2017Updated 8 years ago
- ☆11Feb 23, 2026Updated last week
- This project aims to find "what are the trending techs on Data Science jobs?" using NER.☆12Mar 13, 2022Updated 3 years ago
- A mathematical model for Fibonacci Retracement and location entry and exit formulation using ML☆10Aug 2, 2022Updated 3 years ago
- ☆11Jun 5, 2024Updated last year
- x-transformers-paddle 2.x version☆10May 28, 2023Updated 2 years ago
- This repository contains dataset for paper FedNLP: An interpretable NLP System to Decode Federal Reserve Communications, published in SIG…☆15Feb 7, 2024Updated 2 years ago
- ☆14Sep 30, 2022Updated 3 years ago
- My learning note in monash FIT course include fit9131 fit9132 fit9136 fit5032 fit5057 fit5136 fit5125☆15Nov 3, 2022Updated 3 years ago
- Language Modelling, CMI vs Perplexity☆11Mar 17, 2018Updated 7 years ago
- ACL Rolling Review website☆11Feb 24, 2026Updated last week
- 使用自然语言绘制流程图,基于OpenAI☆12Nov 13, 2023Updated 2 years ago
- Get the best daily repositories☆10Updated this week
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- ☆13May 25, 2023Updated 2 years ago
- C++ library to parse WARC files☆11Jan 27, 2019Updated 7 years ago
- ☆11Mar 13, 2023Updated 2 years ago