A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Learning, Decision Making, and Control
☆27May 5, 2024Updated last year
Alternatives and similar repositories for deep-q-rank
Users that are interested in deep-q-rank are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm☆27Feb 7, 2022Updated 4 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Dec 8, 2017Updated 8 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- Python interface for the Berkeley Parser using JPype☆12Dec 18, 2015Updated 10 years ago
- Project for Berkeley Deep RL course: using deep reinforcement learning for segmentation of medical images☆20Dec 13, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of Deep RL Algorithms for UC Berkeley's CS 294-112 (Fall 2018): Deep Reinforcement Learning☆24Feb 19, 2019Updated 7 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Offline evaluation of multi-armed bandit algorithms☆23Dec 1, 2020Updated 5 years ago
- Data as Demonstrator (DaD) is a meta learning algorithm to improve the multi-step predictive capabilities of a learned time series (e.g. …☆35Aug 25, 2016Updated 9 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Jun 11, 2020Updated 5 years ago
- Learning to Recommend using a Deep Reinforcement Agent☆23Apr 2, 2017Updated 8 years ago
- ☆18Apr 25, 2023Updated 2 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Causal Fairness Analysis☆20Apr 16, 2025Updated 11 months ago
- Repository for SIGIR'18 paper: "Ranking for Relevance and Display Preferences in Complex Presentation Layouts"☆16Aug 28, 2018Updated 7 years ago
- Second project for UW LING 572. Automatic text summarization system.☆13Mar 21, 2013Updated 13 years ago
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- CADRE: Contextual Attention-based Drug REsponse☆12Nov 23, 2020Updated 5 years ago
- Kernelized rank learning for personalized drug recommendation☆16Oct 8, 2018Updated 7 years ago
- Deep Continuous Quantile Regression and other experiments.☆13Feb 24, 2020Updated 6 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 4 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Jun 27, 2018Updated 7 years ago
- Code for "Learning Deep Features in Instrumental Variable Regression" (https://arxiv.org/abs/2010.07154)☆16Sep 16, 2024Updated last year
- Reimplementation of "An Object-Oriented Representation for Efficient RL"☆16Sep 12, 2024Updated last year
- This is a Django App for handle the session exchanging of a WebRTC peer interconnection☆13Mar 7, 2015Updated 11 years ago
- A game search and evaluation parameter tuner using optuna framework☆14Jan 20, 2023Updated 3 years ago
- kaggle竞赛Jane Street Market Prediction实操代码☆13Feb 4, 2021Updated 5 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Matrix Factorization based Movie Recommender System for group of users.☆14May 4, 2017Updated 8 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- R/txshift: Efficient Estimation of the Causal Effects of Stochastic Interventions, with Corrections for Outcome-Dependent Sampling☆13Sep 21, 2024Updated last year