rdangovs / rotational-unit-of-memoryLinks
RUM
☆76Updated 4 years ago
Alternatives and similar repositories for rotational-unit-of-memory
Users that are interested in rotational-unit-of-memory are comparing it to the libraries listed below
Sorting:
- Cooperative Learning of Disjoint Syntax and Semantics☆50Updated 6 years ago
- Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer☆39Updated 4 years ago
- ☆64Updated 5 years ago
- Code for EMNLP 2018 paper "Auto-Encoding Dictionary Definitions into Consistent Word Embeddings"☆36Updated 7 years ago
- Code for the paper "Do Massively Pretrained Language Models Make Better Storytellers?"☆76Updated 3 years ago
- Code for reproducing experiments in our ACL 2019 paper "Probing Neural Network Comprehension of Natural Language Arguments"☆53Updated 3 years ago
- ☆53Updated 3 years ago
- Agents that build knowledge graphs and explore textual worlds by asking questions☆79Updated 2 years ago
- Datasets I have created for scientific summarization, and a trained BertSum model☆114Updated 5 years ago
- Text classification code described in "SoPa: Bridging CNNs, RNNs, and Weighted Finite-State Machines" by Roy Schwartz, Sam Thomson and No…☆54Updated 3 years ago
- NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks☆72Updated 2 years ago
- Repository for the ACL 2020 virtual conference website (work in progress)☆39Updated 3 years ago
- A PyTorch Implementation of FastFusionNet on SQuAD 1.1☆39Updated 6 years ago
- Hybrid Approaches to Detect Comments Violating Macro Norms on Reddit☆28Updated 6 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆62Updated 2 years ago
- Relevant code for the "Show Your Work" paper, EMNLP 2019.☆18Updated 5 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- ☆60Updated 6 years ago
- Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"☆30Updated 6 years ago
- Pip-installable differentiable stacks in PyTorch!☆65Updated 4 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- ARC Question Solvers☆82Updated 4 years ago
- Code and data for paper Colorless Green Recurrent Networks Dream Hierarchically☆93Updated 3 years ago
- ☆178Updated 5 years ago
- Code for the paper: Don't Settle for Average, Go for the Max: Fuzzy Sets and Max-Pooled Word Vectors, ICLR 2019.☆43Updated 3 years ago
- ☆26Updated 6 years ago
- Code for bidirectional sequence generation (BiSon) for generating from BERT pre-trained models.☆51Updated 5 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- [ACL 2018] Conditional Generators of Words Definitions☆33Updated 7 years ago
- ☆103Updated 6 years ago