tberg12 / cse291spr21
☆10Updated 3 years ago
Alternatives and similar repositories for cse291spr21:
Users that are interested in cse291spr21 are comparing it to the libraries listed below
- This is a repository with the code for the ACL 2019 paper "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, t…☆306Updated 3 years ago
- An assignment on creating a minimalist neural network toolkit for CS11-747☆64Updated last year
- Optimus: the first large-scale pre-trained VAE language model☆381Updated last year
- Adversarial Natural Language Inference Benchmark☆393Updated 2 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆310Updated last year
- Understanding the Difficulty of Training Transformers☆327Updated 2 years ago
- Course webpage for COMP 790, (Deep) Learning from Limited Labeled Data☆303Updated 4 years ago
- Important paper implementations for Question Answering using PyTorch☆273Updated 4 years ago
- A list of publications on NLP interpretability (Welcome PR)☆167Updated 4 years ago
- A curated list of awesome advice for computer science Ph.D. applicants.☆284Updated 3 years ago
- Solutions to CS224n: Natural Language Processing with Deep Learning assignments.☆71Updated 8 months ago
- Official codebase for Pretrained Transformers as Universal Computation Engines.☆245Updated 3 years ago
- A new lightweight auto-differentation library that directly builds on numpy. Used as a homework for CMU 11785/11685/11485.☆35Updated 2 years ago
- Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve exis…☆250Updated 3 years ago
- Code for the paper "Are Sixteen Heads Really Better than One?"☆171Updated 4 years ago
- A Natural Language Inference (NLI) model based on Transformers (BERT and ALBERT)☆133Updated 11 months ago
- AI residency programs information☆442Updated last year
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- Fully featured implementation of Routing Transformer☆288Updated 3 years ago
- Fast Block Sparse Matrices for Pytorch☆547Updated 3 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆325Updated last year
- Materials for the EMNLP 2020 Tutorial on "Interpreting Predictions of NLP Models"☆198Updated 4 years ago
- Pytorch implementation of Compressive Transformers, from Deepmind☆155Updated 3 years ago
- [NeurIPS 2020] "The Lottery Ticket Hypothesis for Pre-trained BERT Networks", Tianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Ya…☆139Updated 3 years ago
- Minimal tutorial on packing and unpacking sequences in pytorch☆210Updated 5 years ago
- ☆460Updated 3 years ago
- Annotations of the interesting ML papers I read☆224Updated last week
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 2 years ago
- Python code for various NLP metrics☆166Updated 5 years ago
- List of AI Residency & Research programs, Ph.D Fellowships, Research Internships☆156Updated 4 years ago