A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Learning, Decision Making, and Control
☆27May 5, 2024Updated 2 years ago
Alternatives and similar repositories for deep-q-rank
Users that are interested in deep-q-rank are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm☆27Feb 7, 2022Updated 4 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Dec 8, 2017Updated 8 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- Python interface for the Berkeley Parser using JPype☆12Dec 18, 2015Updated 10 years ago
- Multi-armed bandits for dynamic movie recommendations☆14Nov 20, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- Offline evaluation of multi-armed bandit algorithms☆23Dec 1, 2020Updated 5 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Jun 11, 2020Updated 5 years ago
- ☆10Apr 18, 2017Updated 9 years ago
- Learning to Recommend using a Deep Reinforcement Agent☆23Apr 2, 2017Updated 9 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- Making high-accuracy and visually-interpretable decision tree-based models for semantic segmentation http://segnbdt.aaalv.in☆11Oct 12, 2021Updated 4 years ago
- My PhD thesis, titled "Reasonably Programmable Syntax"☆15Aug 28, 2018Updated 7 years ago
- CADRE: Contextual Attention-based Drug REsponse☆12Nov 23, 2020Updated 5 years ago
- Deep Continuous Quantile Regression and other experiments.☆13Feb 24, 2020Updated 6 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- The Trade Desk Api☆10Jul 8, 2020Updated 5 years ago
- Implementation/experiments for L4DC 2020 submission "Optimal Cost Design for Model Predictive Control"☆12Apr 23, 2021Updated 5 years ago
- My solutions to Berkeley's CS294 (Deep Reinforcement Learning) Homework☆36Jun 27, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official web interface for pyLoad☆15Aug 23, 2020Updated 5 years ago
- Code to reproduce experiments appearing in the academic paper Lost Relatives of the Gumbel Trick☆17Jun 14, 2017Updated 8 years ago
- Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization (NeurIPS 21')☆23Dec 9, 2021Updated 4 years ago
- Simulation and Visualization tool for the Robot Raconteur robotics middleware☆11May 6, 2020Updated 6 years ago
- Benchmark Python and Cython code☆13Jun 13, 2014Updated 11 years ago
- A game search and evaluation parameter tuner using optuna framework☆14Jan 20, 2023Updated 3 years ago
- kaggle竞赛Jane Street Market Prediction实操代码☆13Feb 4, 2021Updated 5 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- NeurIPS 2020 Spotlight Paper☆13Dec 20, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- Matrix Factorization based Movie Recommender System for group of users.☆14May 4, 2017Updated 9 years ago
- Source Code of IJCAI 2022 paper "Fine-Tuning Graph Neural Networks via Graph Topology induced Optimal Transport"☆23Aug 22, 2022Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Library for fast text representation and classification.☆10Apr 17, 2022Updated 4 years ago
- Implicit Distributional Actor Critic☆11Dec 8, 2021Updated 4 years ago
- Blog post: how to do deterministic policy gradient with gumbel softmax and why you should do it.☆12Jun 20, 2017Updated 8 years ago