neubig / minnn-assignmentLinks

An assignment on creating a minimalist neural network toolkit for CS11-747

☆64

Alternatives and similar repositories for minnn-assignment

Users that are interested in minnn-assignment are comparing it to the libraries listed below

Sorting:

ofirpress / sandwich_transformer
This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …
☆55Updated 4 years ago
TimDettmers / transformer-xl
☆65Updated 5 years ago
lyeoni / pretraining-for-language-understanding
Pre-training of Language Models for Language Understanding
☆83Updated 6 years ago
allenai / sledgehammer
☆48Updated 5 years ago
neubig / mtandseq2seq-code
Code examples for CMU CS11-731, Machine Translation and Sequence-to-sequence Models
☆35Updated 6 years ago
IKMLab / arct2
Code for reproducing experiments in our ACL 2019 paper "Probing Neural Network Comprehension of Natural Language Arguments"
☆54Updated 3 years ago
allenai / allentune
Hyperparameter Search for AllenNLP
☆140Updated 8 months ago
serrano-s / attn-tests
Checking the interpretability of attention on text classification models
☆49Updated 6 years ago
brendenlake / meta_seq2seq
PyTorch code for meta seq2seq learning
☆43Updated 5 years ago
cgraywang / transformer-on-diet
Code repo for "Transformer on a Diet" paper
☆31Updated 5 years ago
uds-lsv / bert-stable-fine-tuning
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
☆137Updated 2 years ago
allenai / tpu_pretrain
LM Pretraining with PyTorch/TPU
☆136Updated 6 years ago
sarahwie / attention
Code for EMNLP 2019 paper "Attention is not not Explanation"
☆58Updated 4 years ago
pmichel31415 / jsalt-2019-mt-tutorial
MT Tutorial for the JSALT 2019 Summer School
☆48Updated 6 years ago
fdalvi / NeuroX-demo
☆66Updated 2 years ago
cambridgeltl / parameter-factorization
Factorization of the neural parameter space for zero-shot multi-lingual and multi-task transfer
☆39Updated 5 years ago
cybertronai / transformer-xl
Training Transformer-XL on 128 GPUs
☆141Updated 5 years ago
reubenharry / Recurrent-RSA
Code for NAACL paper
☆21Updated 7 years ago
srush / awesome-ml-tracking
☆104Updated 4 years ago
epfml / collaborative-attention
Code for Multi-Head Attention: Collaborate Instead of Concatenate
☆152Updated 2 years ago
allenai / ARC-Solvers
ARC Question Solvers
☆82Updated 4 years ago
lena-voita / description-length-probing
This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"
☆71Updated last year
ethanjperez / convince
Finding Generalizable Evidence by Learning to Convince Q&A Models
☆25Updated 2 years ago
boknilev / nlp-analysis-methods
Companion site for "Analysis Methods in Neural Language Processing: A Survey"
☆66Updated 5 years ago
acmi-lab / counterfactually-augmented-data
Learning the Difference that Makes a Difference with Counterfactually-Augmented Data
☆170Updated 4 years ago
tnq177 / transformers_without_tears
Transformers without Tears: Improving the Normalization of Self-Attention
☆134Updated last year
nttcslab-nlp / doc_lm
☆12Updated 6 years ago
sgraaf / Replicate-Toronto-BookCorpus
This repository contains code to replicate the no-longer publicly available Toronto BookCorpus dataset
☆49Updated 3 years ago
jayded / eraserbenchmark
A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/
☆99Updated 3 years ago
harvardnlp / encoder-agnostic-adaptation
Encoder-Agnostic Adaptation for Conditional Language Generation
☆80Updated last year