AaronJi/RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AaronJi/RL)

AaronJi / RL

A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm

☆27

Alternatives and similar repositories for RL

Users that are interested in RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

abhi1345 / deep-q-rank
View on GitHub
A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Lea…
☆27May 5, 2024Updated 2 years ago
LihangLiu / Generator-Evaluator
View on GitHub
☆12Jun 17, 2019Updated 7 years ago
xeniaqian94 / RLeToR
View on GitHub
A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…
☆18Dec 8, 2017Updated 8 years ago
XueyingBai / Model-Based-Reinforcement-Learning-for-Online-Recommendation
View on GitHub
A pytorch implementation of A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation.
☆40Nov 26, 2019Updated 6 years ago
appurwar / Contextual-Bandit-News-Article-Recommendation
View on GitHub
Predict and recommend the news articles, user is most likely to click in real time.
☆32Apr 3, 2018Updated 8 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SunwardTree / TDDPG-Rec
View on GitHub
The code to reproduce the experimental results for "A Text-based Deep Reinforcement Learning Framework for Interactive Recommendation".
☆12Mar 18, 2021Updated 5 years ago
ieyjzhou / KmeansPlusPlus
View on GitHub
k-means++: a C++ version implement
☆19Dec 26, 2017Updated 8 years ago
nuster1128 / RecBole-MetaRec
View on GitHub
RecBole-MetaRec is an extended module for RecBole, which aims to help researchers to compare and develop their own models in the meta lea…
☆26Feb 8, 2023Updated 3 years ago
antoine-hochart / bandit_algo_evaluation
View on GitHub
Offline evaluation of multi-armed bandit algorithms
☆23Dec 1, 2020Updated 5 years ago
vaimee / desmo
View on GitHub
A distributed Oracle system for IoT data
☆11Apr 12, 2023Updated 3 years ago
boschresearch / DD_OPG
View on GitHub
Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.
☆11Jun 12, 2019Updated 7 years ago
jvanz / libwarc
View on GitHub
C++ library to parse WARC files
☆11Jan 27, 2019Updated 7 years ago
manxing-du / cmdp-rtb
View on GitHub
☆10Apr 18, 2017Updated 9 years ago
hpclab / efficient-query-expansion
View on GitHub
Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018
☆15Nov 17, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
sjubertie / teaching-SIMD
View on GitHub
Lecture on SIMD units
☆11Feb 28, 2017Updated 9 years ago
Applied-Machine-Learning-Lab / Diff-MSR
View on GitHub
Code for 'Diff-MSR: A Diffusion Model Enhanced Paradigm for Cold-Start Multi-Scenario Recommendation' accepted to WSDM 2024
☆15Aug 1, 2025Updated 11 months ago
gy910210 / exact-k-recommendation
View on GitHub
Code for paper 'Exact-K Recommendation via Maximal Clique Optimization'
☆79Feb 26, 2020Updated 6 years ago
BorgwardtLab / Kernelized-Rank-Learning
View on GitHub
Kernelized rank learning for personalized drug recommendation
☆16Oct 8, 2018Updated 7 years ago
greensky00 / latency-collector
View on GitHub
Latency collector as an embedded library for C++
☆13May 26, 2019Updated 7 years ago
praekeltfoundation / docker-ci-deploy
View on GitHub
Python script to help push Docker images to a registry using CI services
☆20Dec 18, 2018Updated 7 years ago
bigganbing / Fairseq_MorphTE
View on GitHub
[NeurIPS 2022]MorphTE: Injecting Morphology in Tensorized Embeddings
☆17Oct 29, 2022Updated 3 years ago
amallia / gpu-integers-compression
View on GitHub
GPU-Accelerated Faster Decoding of Integer Lists
☆13Aug 20, 2019Updated 6 years ago
weicy15 / GCS
View on GitHub
☆14Nov 21, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lemon234071 / TransformerBaselines
View on GitHub
☆23Dec 31, 2020Updated 5 years ago
rmit-ir / joint-cascade-ranking
View on GitHub
Joint Optimization of Cascade Ranking Models (WSDM 19)
☆13Jun 21, 2022Updated 4 years ago
pfsir / StockAssistant
View on GitHub
股票/基金/债券的相关信息的协助应用。开发原因主要是不想装太多app，比如集思录，蛋卷之类的，把他们部分数据集合到这个app上
☆10Sep 15, 2021Updated 4 years ago
oservo / aiFi
View on GitHub
A Multi Layer Perceptron (MLP) Artificial Neural Network (ANN) Framework Developed in C for Machine Learning (ML) and Deep Learning (DL)
☆11May 4, 2025Updated last year
nitingupta910 / TripleBit
View on GitHub
RDF Graph Database (http://grid.hust.edu.cn/triplebit/)
☆11Sep 19, 2014Updated 11 years ago
bhavik08 / Group-movie-recommender-system
View on GitHub
Matrix Factorization based Movie Recommender System for group of users.
☆14May 4, 2017Updated 9 years ago
egipcy / LIRD
View on GitHub
Deep Reinforcement Learning for Movies Recommendation System
☆83Jan 5, 2020Updated 6 years ago
miniHuiHui / SimpleRL-reason-GRPO
View on GitHub
☆12Feb 27, 2025Updated last year
pirate / OS-X-Security-and-Privacy-Guide
View on GitHub
A practical guide to securing OS X
☆10Apr 25, 2016Updated 10 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Kingsford-Group / splitsbt
View on GitHub
☆18Jul 9, 2018Updated 8 years ago
bwanglzu / Maximal-Marginal-Relevance
View on GitHub
MMR for information retrieval
☆18Sep 22, 2017Updated 8 years ago
hhhwmws0117 / GLM-VITS-SadTalker
View on GitHub
浅尝LLM
☆33Jun 14, 2023Updated 3 years ago
xinshi-chen / GenerativeAdversarialUserModel
View on GitHub
Tensorflow implementation for "Generative Adversarial User Model forReinforcement Learning Based Recommendation System"
☆131Sep 10, 2019Updated 6 years ago
dodoyang0929 / DGG
View on GitHub
☆12Jun 18, 2020Updated 6 years ago
WeiyuCheng / AFN-AAAI-20
View on GitHub
Source codes for our AAAI'20 paper: Adaptive Factorization Network: Learning Adaptive-Order Feature Interactions
☆38Sep 22, 2020Updated 5 years ago
burmanm / compression-int
View on GitHub
64-bit integer compression algorithms in Java
☆15Nov 11, 2018Updated 7 years ago