seanswyi / transformer-implementation
Personal implementation of the Transformer paper.
☆22Updated last year
Alternatives and similar repositories for transformer-implementation:
Users that are interested in transformer-implementation are comparing it to the libraries listed below
- NLP Examples using the 🤗 libraries☆42Updated 3 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- Yet another mini autodiff system for educational purposes☆28Updated 2 months ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆62Updated 8 months ago
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022☆28Updated 2 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆48Updated last year
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆145Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆31Updated 2 years ago
- Lightning template for easy prototyping⚡️☆13Updated 2 years ago
- Multitask Learning with Pretrained Transformers☆39Updated 3 years ago
- ☆74Updated 3 years ago
- No Teacher BART distillation experiment for NLI tasks☆26Updated 4 years ago
- Distillation of BERT model with catalyst framework☆75Updated last year
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆69Updated 3 years ago
- Sequence models in Numpy☆25Updated 4 years ago
- This is the second part of the Deep Learning Course for the Master in High-Performance Computing (SISSA/ICTP).)☆33Updated 4 years ago
- Code repository for the NAACL 2022 paper "ExSum: From Local Explanations to Model Understanding"☆63Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- ☆42Updated last year
- Hierarchical Attention Transformers (HAT)☆46Updated last year
- ☆21Updated 3 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries☆19Updated 3 years ago
- ML Reproducibility Challenge 2020: Electra reimplementation using PyTorch and Transformers☆12Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 3 years ago
- ☆46Updated 4 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆52Updated last year
- KitanaQA: Adversarial training and data augmentation for neural question-answering models☆57Updated last year
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 3 years ago