☆19Oct 21, 2021Updated 4 years ago
Alternatives and similar repositories for CrossDQN
Users that are interested in CrossDQN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Mar 30, 2026Updated last month
- This project contains several Deep Reinforcement Learning method and some experiments basd on OpenAi gym.☆19Jan 28, 2018Updated 8 years ago
- A Python and Jsonnet framework for handling espanso configurations☆11Oct 6, 2025Updated 7 months ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆131Mar 21, 2021Updated 5 years ago
- ☆44Sep 19, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆53Aug 11, 2021Updated 4 years ago
- 跑得快AI 采用蒙特卡洛算法☆13Oct 20, 2017Updated 8 years ago
- Offline RL algoritms implemented in Stable Baselines3 (pytorch)☆10Dec 7, 2021Updated 4 years ago
- Controllable Multi-Objective Re-ranking with Policy Hypernetworks (KDD 2023)☆38Oct 6, 2024Updated last year
- personal web,个人网站开发,使用vue3+vite+element-plus技术。包括登录、大模型对话、博客记录、项目仓库、付费使用等功能☆10Sep 24, 2024Updated last year
- ☆16Feb 15, 2018Updated 8 years ago
- Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)☆62Apr 29, 2024Updated 2 years ago
- Source code for paper "Generative Flow Network for Listwise Recommendation"☆17Nov 8, 2024Updated last year
- source code for AAMAS 2023 Imperfect-information Card Game Competition☆13Mar 21, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- code for the paper "Personalized Re-ranking for E-commerce Recommender Systems"☆30Feb 8, 2022Updated 4 years ago
- Code for 'Diff-MSR: A Diffusion Model Enhanced Paradigm for Cold-Start Multi-Scenario Recommendation' accepted to WSDM 2024☆13Aug 1, 2025Updated 9 months ago
- ☆18Apr 11, 2024Updated 2 years ago
- Top 1 - code for rong360 data mining competition☆49May 26, 2017Updated 8 years ago
- ☆31Jan 16, 2023Updated 3 years ago
- Implementation and experimental comparison of ES-DFM (Yang et al. 2021), Delayed feedback model(DFM, Chapelle 2014), Feedback Shift Impor…☆84Aug 31, 2023Updated 2 years ago
- ☆16Oct 5, 2021Updated 4 years ago
- LibRerank is a toolkit for re-ranking algorithms. There are a number of re-ranking algorithms, such as PRM, DLCM, GSF, miDNN, SetRank, EG…☆269Feb 21, 2022Updated 4 years ago
- Deep Reinforcement Learning for Movies Recommendation System☆82Jan 5, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 用强化学习来玩微信跳一跳☆12Jul 10, 2022Updated 3 years ago
- Variational Information Bottleneck☆16Nov 26, 2018Updated 7 years ago
- Python Package for EIT(Electric Impedance Tomography)-like problems using Gauss-Newton method.☆16Nov 5, 2025Updated 6 months ago
- Machine Learning with Graphs (CS224W) project.☆13Dec 16, 2021Updated 4 years ago
- RL Recommendation System☆13Aug 30, 2019Updated 6 years ago
- 2021 QQ浏览器ai算法大赛 赛道一 决赛第17名☆17Oct 25, 2022Updated 3 years ago
- ☆12Jan 5, 2023Updated 3 years ago
- 2019年腾讯广告算法大赛rank68☆14Jun 14, 2019Updated 6 years ago
- A RAG system is just the beginning of harnessing the power of LLM. The next step is creating an intelligent Agent. In Agentic RAG the Ag…☆14May 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆59Dec 18, 2024Updated last year
- Benchmark data for d3rlpy☆21Nov 28, 2023Updated 2 years ago
- Explore the potential of recommendation system using reinforcement learning☆15Apr 23, 2020Updated 6 years ago
- Neural Fuzzy Repair (NFR) is a data augmentation pipeline, which integrates fuzzy matches (i.e. similar translations) into neural machine…☆12Aug 14, 2024Updated last year
- ☆22Dec 18, 2023Updated 2 years ago
- 2020腾讯广告算法大赛方案分享及代码(冠军)☆14May 1, 2023Updated 3 years ago
- Recommender system based on Flink and Reinforcement Learning☆110Nov 1, 2020Updated 5 years ago