castorini / transformers-arithmetic
☆38Updated 3 years ago
Alternatives and similar repositories for transformers-arithmetic:
Users that are interested in transformers-arithmetic are comparing it to the libraries listed below
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- ☆22Updated 3 years ago
- ☆45Updated 3 years ago
- ☆48Updated last year
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated 2 years ago
- Evaluating Machines by their Real-World Language Use☆33Updated last year
- ☆67Updated 2 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Updated 2 years ago
- ☆35Updated 8 months ago
- Code for the paper "Implicit Representations of Meaning in Neural Language Models"☆50Updated last year
- ☆19Updated 2 years ago
- Diagnostic benchmark suite to explicitly test logical relational reasoning on natural language☆92Updated 8 months ago
- ☆50Updated 3 years ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆32Updated 3 years ago
- Finding Generalizable Evidence by Learning to Convince Q&A Models☆25Updated 2 years ago
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆20Updated 4 years ago
- Neural Unification for Logic Reasoning over Language☆22Updated 3 years ago
- Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling☆15Updated 3 years ago
- [EMNLP 2020] PyTorch code of PRover: Proof Generation for Interpretable Reasoning over Rules☆19Updated last year
- Official code for the ICLR 2020 paper 'ARE PPE-TRAINED LANGUAGE MODELS AWARE OF PHRASES? SIMPLE BUT STRONG BASELINES FOR GRAMMAR INDCUTIO…☆30Updated last year
- ☆42Updated 4 years ago
- Compositional generalization through meta sequence-to-sequence learning☆84Updated 5 years ago
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 4 years ago
- Code accompanying our papers on the "Generative Distributional Control" framework☆117Updated 2 years ago
- Pretraining summarization models using a corpus of nonsense☆13Updated 3 years ago
- ☆22Updated 3 years ago
- Source Code for paper "Learning from Explanations with Neural Execution Tree", ICLR 2020☆18Updated 3 years ago
- Code and pre-trained models for "ReasonBert: Pre-trained to Reason with Distant Supervision", EMNLP'2021☆29Updated last year
- ☆49Updated last year
- ☆63Updated 2 years ago