Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
☆13Jul 7, 2022Updated 3 years ago
Alternatives and similar repositories for DRL4Recsys
Users that are interested in DRL4Recsys are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An agent that performs user actions on a workstation☆13Jan 22, 2018Updated 8 years ago
- Course project for https://deeppavlov.ai/rl_course_2020☆39Jan 2, 2023Updated 3 years ago
- ☆40Nov 16, 2022Updated 3 years ago
- ☆10Feb 18, 2020Updated 6 years ago
- grpo to train long form QA and instructions with long-form reward model☆17Jul 17, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Test-time Fourier Style Calibration for Domain Generalization - IJCAI 2022☆16Jul 21, 2022Updated 3 years ago
- ☆28Jul 18, 2025Updated 10 months ago
- Source code for our paper "Joint Policy-Value Learning for Recommendation" published at KDD 2020.☆23Jul 6, 2023Updated 2 years ago
- This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".☆18Nov 18, 2025Updated 6 months ago
- KERL: A Knowledge-Guided Reinforcement Learning Model for Sequential Recommendation☆51Jun 25, 2020Updated 5 years ago
- Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems☆306May 4, 2023Updated 3 years ago
- ☆14Aug 28, 2024Updated last year
- RL Recommendation System☆13Aug 30, 2019Updated 6 years ago
- 南京大学本科毕业论文模板☆13Jun 1, 2016Updated 10 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Jun 6, 2020Updated 6 years ago
- Official PyTorch implementation for the ICML 2023 paper "Out-of-Distribution Generalization of Federated Learning via Implicit Invariant …☆14Oct 31, 2023Updated 2 years ago
- [ICCVW 2021] Rethinking Content and Style: Exploring Bias for Unsupervised Disentanglement☆20Aug 18, 2021Updated 4 years ago
- Source code for the paper "Controlling the Risk of Conversational Search via Reinforcement Learning" and "Simulating and Modeling the Ris…☆12Aug 11, 2023Updated 2 years ago
- source code of the paper "[CIKM 2023] Task-Difficulty-Aware Meta-Learning with Adaptive Update Strategies for User Cold-Start Recommendat…☆10Oct 27, 2023Updated 2 years ago
- An open source video conferencing tool for the XO laptop☆16Sep 20, 2013Updated 12 years ago
- Dataset Batch(offline) Reinforcement Learning for recommender system☆154Nov 1, 2020Updated 5 years ago
- A curated list of disentanglement in NLP. :-)☆17Oct 31, 2021Updated 4 years ago
- paper list in the area of reinforcenment learning for recommendation systems☆25Aug 4, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆16Feb 15, 2023Updated 3 years ago
- ☆54Aug 26, 2018Updated 7 years ago
- Implementation of 6 DQN extension methods using Pytorch. (RAINBOW)☆16Dec 7, 2020Updated 5 years ago
- Code for paper "Adversarial Support Alignment"☆23Apr 22, 2022Updated 4 years ago
- Recommendation system with actor and critic☆18Aug 10, 2022Updated 3 years ago
- The final code submission of Meta_Learners, which won the second place in NIPS 2018 AutoML Challenge☆17Dec 8, 2018Updated 7 years ago
- Deep Reinforcement Learning for Movies Recommendation System☆83Jan 5, 2020Updated 6 years ago
- Code for Multi-Aspect Cross-modal Quantization for Generative Recommendation. (AAAI 2026 Oral)☆41Dec 9, 2025Updated 6 months ago
- ☆47Jan 8, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 编译原理之词法分析器☆14Oct 30, 2017Updated 8 years ago
- This is the source code for HDNO: a hierarchical model for task-oriented dialogue system.☆18Dec 7, 2022Updated 3 years ago
- Dataset of conversations, generated by prompting Gemini Ultra. These are conversations between a teacher and a student, where the teacher…☆36Oct 29, 2024Updated last year
- This repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".☆25Oct 21, 2024Updated last year
- Survey for Distribution Shift☆19Jun 1, 2021Updated 5 years ago
- Collaborative Translational Metric Learning (ICDM 2018)☆18Sep 15, 2021Updated 4 years ago
- KuaiSearch PERKS☆12Nov 16, 2021Updated 4 years ago