A set of RL experiments. Currently including: (1) the MDP rank experiment, based on policy gradient algorithm
☆27Feb 7, 2022Updated 4 years ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A deep reinforcement learning approach to search engine ranking (PyTorch). Final Project for UC Berkeley's CS 285: Deep Reinforcement Lea…☆27May 5, 2024Updated 2 years ago
- ☆12Jun 17, 2019Updated 7 years ago
- A PyTorch implementation of REINFORCE Learning To Rank on OSHUMED, MQ, etc. dataset. Basic idea also appears in SIGIR'17 Reinforcement Le…☆18Dec 8, 2017Updated 8 years ago
- A pytorch implementation of A Model-Based Reinforcement Learning with Adversarial Training for Online Recommendation.☆40Nov 26, 2019Updated 6 years ago
- Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130☆30Jun 11, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The code to reproduce the experimental results for "A Text-based Deep Reinforcement Learning Framework for Interactive Recommendation".☆12Mar 18, 2021Updated 5 years ago
- Offline evaluation of multi-armed bandit algorithms☆23Dec 1, 2020Updated 5 years ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆29Jun 16, 2025Updated last year
- C++ library to parse WARC files☆11Jan 27, 2019Updated 7 years ago
- ☆10Apr 18, 2017Updated 9 years ago
- Learning to Recommend using a Deep Reinforcement Agent☆23Apr 2, 2017Updated 9 years ago
- Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018☆15Nov 17, 2019Updated 6 years ago
- C++ heterogeneous and lock-free containers☆13Sep 5, 2018Updated 7 years ago
- ☆13Nov 15, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code for 'Diff-MSR: A Diffusion Model Enhanced Paradigm for Cold-Start Multi-Scenario Recommendation' accepted to WSDM 2024☆14Aug 1, 2025Updated 11 months ago
- Lecture on SIMD units☆11Feb 28, 2017Updated 9 years ago
- A dynamic version of std::bitset☆17Aug 25, 2013Updated 12 years ago
- ☆10May 22, 2023Updated 3 years ago
- Latency collector as an embedded library for C++☆13May 26, 2019Updated 7 years ago
- 用强化学习来玩微信跳一跳☆12Jul 10, 2022Updated 3 years ago
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- An extensive and commented list of resources on Learned Sparse Retrieval.☆61Jun 12, 2026Updated 2 weeks ago
- TrOCR but 2 to 3 times faster☆11Oct 22, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆23Dec 31, 2020Updated 5 years ago
- Code for "Learning Deep Features in Instrumental Variable Regression" (https://arxiv.org/abs/2010.07154)☆16Sep 16, 2024Updated last year
- Compressed Bitmap in C++ for bitmap Indexes.☆11Dec 17, 2019Updated 6 years ago
- GPU-Accelerated Faster Decoding of Integer Lists☆13Aug 20, 2019Updated 6 years ago
- Joint Optimization of Cascade Ranking Models (WSDM 19)☆13Jun 21, 2022Updated 4 years ago
- Sequential recommendation algorithm☆28Dec 28, 2018Updated 7 years ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated 2 years ago
- ☆12Feb 27, 2025Updated last year
- A RAG system is just the beginning of harnessing the power of LLM. The next step is creating an intelligent Agent. In Agentic RAG the Ag…☆14May 31, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Multi Layer Perceptron (MLP) Artificial Neural Network (ANN) Framework Developed in C for Machine Learning (ML) and Deep Learning (DL)☆11May 4, 2025Updated last year
- A game search and evaluation parameter tuner using optuna framework☆14Jun 20, 2026Updated last week
- A fast, compact trigram library for Icelandic.☆12Jun 11, 2026Updated 3 weeks ago
- RDF Graph Database (http://grid.hust.edu.cn/triplebit/)☆11Sep 19, 2014Updated 11 years ago
- RLlib tutorials☆66Jan 2, 2022Updated 4 years ago
- A reference implementation of std::simd, providing data parallel types in the C++ standard☆14Mar 9, 2020Updated 6 years ago
- DocId set compression and set operation library☆27Apr 16, 2014Updated 12 years ago