ustcljb/topK-off-policy-correction-REINFORCE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ustcljb/topK-off-policy-correction-REINFORCE)

ustcljb / topK-off-policy-correction-REINFORCE

☆19

Alternatives and similar repositories for topK-off-policy-correction-REINFORCE

Users that are interested in topK-off-policy-correction-REINFORCE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mercurialgh / Reproduce-of-Top-K-Off-Policy-Correction-for-a-REINFORCE-Recommender-System
View on GitHub
Reproduce of Top-K Off-Policy Correction for a REINFORCE Recommender System
☆26Jul 15, 2020Updated 6 years ago
BestActionNow / Slate_Aware_Ranking
View on GitHub
The implementation for our paper "Slate-Aware Ranking for Recommendation" accepted by WSDM.23
☆16Dec 13, 2022Updated 3 years ago
CW-Huang / BayesianHypernet
View on GitHub
☆17Jan 5, 2018Updated 8 years ago
jim-meyer / lottery_ticket_pruner
View on GitHub
(Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning
☆10Dec 8, 2022Updated 3 years ago
zhangsi / CisRec
View on GitHub
Cis Recommender
☆16May 1, 2012Updated 14 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
machenslab / elife2016dpca
View on GitHub
Preprocessing and analysis scripts for the dPCA paper (eLife 2016)
☆10Jul 15, 2016Updated 10 years ago
awarebayes / RecNN
View on GitHub
Reinforced Recommendation toolkit built around pytorch 1.7
☆589Dec 8, 2020Updated 5 years ago
franrruiz / uivi
View on GitHub
Code for Unbiased Implicit Variational Inference (UIVI)
☆15Jan 18, 2019Updated 7 years ago
astirn / MV-Kumaraswamy
View on GitHub
☆12Dec 8, 2022Updated 3 years ago
zhijie-ai / Top-K-Off-Policy-Correction-REINFORCE
View on GitHub
☆25Dec 7, 2020Updated 5 years ago
jiaqima / Off-Policy-2-Stage
View on GitHub
Off-policy Learning in Two-stage Recommender Systems. https://dl.acm.org/doi/pdf/10.1145/3366423.3380130
☆30Jun 11, 2020Updated 6 years ago
xuyuandong / simple-ddpg
View on GitHub
☆23Dec 5, 2018Updated 7 years ago
glajoie / MAT6115_Dynamical_Systems
View on GitHub
Support material for MAT6115, Université de Montréal, Fall 2018
☆28Jan 2, 2024Updated 2 years ago
modriczhang / HRL-Rec
View on GitHub
"Hierarchical Reinforcement Learning for Integrated Recommendation" (AAAI 2021) https://ojs.aaai.org/index.php/AAAI/article/view/16580
☆58Sep 12, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
SamuelGabriel / LotteryTicketHypothesis-TensorFlow
View on GitHub
Implementation of the most important parts of the Lottery Ticket Hypothesis Paper
☆12Jul 2, 2018Updated 8 years ago
AiHubCN / Awesome-Sequence-Modeling-for-Recommendation
View on GitHub
An Awesome Collection for Sequential Recommendation and Sequence Modeling in Recommend System
☆39Nov 19, 2023Updated 2 years ago
mxu34 / mbrl-gpmm
View on GitHub
☆28Jun 23, 2020Updated 6 years ago
langholz / owlqn
View on GitHub
Orthant-Wise Limited-memory Quasi-Newton Optimizer for L1-regularized Objectives
☆10Mar 9, 2014Updated 12 years ago
mazefeng / lightmf
View on GitHub
A light-weight matrix factorization tool
☆39Nov 17, 2017Updated 8 years ago
michaelnny / InstructLLaMA
View on GitHub
Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the …
☆57Mar 9, 2024Updated 2 years ago
llq20133100095 / deep-tiaotiao
View on GitHub
用强化学习来玩微信跳一跳
☆12Jul 10, 2022Updated 4 years ago
zepingyu0512 / sli_rec
View on GitHub
☆75Dec 27, 2019Updated 6 years ago
aasish / userIntentDataset
View on GitHub
☆14Dec 27, 2016Updated 9 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
anirudh9119 / walkback_nips17
View on GitHub
Variational Walkback, NIPS'17
☆28Oct 18, 2017Updated 8 years ago
swastikmaiti / LlamaIndex-Agent
View on GitHub
A RAG system is just the beginning of harnessing the power of LLM. The next step is creating an intelligent Agent. In Agentic RAG the Ag…
☆14May 31, 2024Updated 2 years ago
AngusMonroe / Active-NER
View on GitHub
Bayesian Deep Active Learning for Named entity recognition (NER)
☆19Jan 17, 2020Updated 6 years ago
andrew-zzz / tree-based-deep-model
View on GitHub
it's the realization of Tree-based Deep Model with tensorflow
☆33Feb 24, 2020Updated 6 years ago
JeremyAlain / imitation_learning_from_language_feedback
View on GitHub
This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"
☆26Mar 30, 2023Updated 3 years ago
hoyeoplee / MeLU
View on GitHub
☆184Oct 1, 2019Updated 6 years ago
yosinski / GitResultsManager
View on GitHub
A simple snapshotting system for managing research results using Git.
☆30Mar 27, 2025Updated last year
LoryPack / LLM-LieDetector
View on GitHub
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆74Jun 19, 2024Updated 2 years ago
search-opensource-space / FashionBERT
View on GitHub
☆11Sep 18, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
easezyc / MetaHeac
View on GitHub
This is an official implementation for "Learning to Expand Audience via Meta Hybrid Experts and Critics for Recommendation and Advertisin…
☆59Feb 22, 2022Updated 4 years ago
crystalajj / word2vec
View on GitHub
Python implementation of Word2Vec using skip-gram and negative sampling
☆12Dec 8, 2015Updated 10 years ago
KID-22 / PCIC2021-Baselines
View on GitHub
Some baselines for PCIC2021 Track 2: Causal Inference and Recommendation
☆17May 29, 2023Updated 3 years ago
Ethan00Si / KuaiSAR
View on GitHub
This repository has been redirected into https://kuaisar.github.io/.
☆11Oct 12, 2023Updated 2 years ago
waterhorse1 / MELU_pytorch
View on GitHub
An unofficial pytorch implementation of MELU
☆46Aug 7, 2024Updated last year
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
jeffhj / DEER
View on GitHub
The implementation for "DEER: Descriptive Knowledge Graph for Explaining Entity Relationships" (EMNLP '22)
☆11Oct 31, 2022Updated 3 years ago