shuishida / Multi-Armed-Bandit
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Multi-Armed-Bandit
- Code for ACL 2022 Paper: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons☆14Updated last year
- some tutorials for blog: simonjisu.github.io☆23Updated 3 years ago
- Transformer based Trigram Blocking implementation in Tensorflow☆11Updated 4 years ago
- ☆16Updated last year
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 3 years ago
- Bi-Directional Attention Flow for Machine Comprehensions☆10Updated 6 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- ☆10Updated 4 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Updated last year
- This repo contains datasets and code for Assessing Phrasal Representation and Composition in Transformers, by Lang Yu and Allyson Ettinge…☆11Updated 3 years ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated 9 months ago
- Embedding Recycling for Language models☆38Updated last year
- Detecting topic clusters in arXiv ML papers.☆12Updated 4 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Updated 3 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- Statistics and Accepted paper list of ACL 2020 with arXiv link☆23Updated 4 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Updated 2 years ago
- ☆66Updated 2 years ago
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Updated last year
- Pretraining summarization models using a corpus of nonsense☆13Updated 3 years ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Updated 4 years ago
- Personalized and Interactive Music Recommendation with Bandit approach☆10Updated 5 years ago
- ☆11Updated 3 years ago
- python project template for personal projects! 🙋♀️☆10Updated 3 years ago
- ☆16Updated 4 years ago
- Generate and train embeddings with a graph neural network and deploy as an API in a few lines of code☆9Updated 4 years ago
- simple reinforcement learning example for the minecraft☆9Updated 6 years ago
- Code for the RecSys20 paper -- Unbiased Implicit Recommendation and Propensity Estimation via Combinational Joint Learning☆10Updated 4 years ago
- EMNLP 2021 Adapting Language Models for Zero-shot Learning by Meta-tuning on Dataset and Prompt Collections☆49Updated 3 years ago
- Few-shot Learning with Auxiliary Data☆26Updated 11 months ago