sgraaf / Replicate-Toronto-BookCorpus
This repository contains code to replicate the no-longer publicly available Toronto BookCorpus dataset
☆49Updated 2 years ago
Alternatives and similar repositories for Replicate-Toronto-BookCorpus:
Users that are interested in Replicate-Toronto-BookCorpus are comparing it to the libraries listed below
- Hyperparameter Search for AllenNLP☆135Updated last month
- Code for the paper "Latent Relation Language Models" at AAAI-20.☆41Updated 4 years ago
- ☆74Updated 3 years ago
- Boolean Question Answering with multi-task learning and uses large LM embeddings like BERT, RoBERTa☆18Updated 5 years ago
- Assessing syntactic abilities of BERT☆148Updated 5 years ago
- Code for the paper "Improving Robustness of Machine Translation with Synthetic Noise"☆21Updated 5 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- Code for EMNLP 2018 paper "Auto-Encoding Dictionary Definitions into Consistent Word Embeddings"☆36Updated 6 years ago
- ☆46Updated 5 years ago
- A python tool for building large scale Wikipedia-based Information Retrieval datasets☆46Updated 3 years ago
- Assessing syntactic abilities of BERT☆39Updated 5 years ago
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- Companion site for "Analysis Methods in Neural Language Processing: A Survey"☆66Updated 4 years ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆42Updated 5 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 4 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆62Updated 2 years ago
- ☆33Updated 5 years ago
- ☆33Updated 3 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆75Updated 4 years ago
- Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions o…☆101Updated last year
- numeric fused-head identification and resolution☆33Updated 5 years ago
- A template for starting an allennlp project using a python script instead of config files☆27Updated 11 months ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Data and code for Kang et al., EMNLP 2019's paper titled "(Male, Bachelor) and (Female, Ph.D) have different connotations: Parallelly Ann…☆29Updated 4 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 4 years ago
- ☆32Updated 3 years ago
- The bAbI question-answering dataset ported into T2T.☆32Updated 6 years ago