seanswyi / transformer-implementation
Personal implementation of the Transformer paper.
β22Updated last year
Alternatives and similar repositories for transformer-implementation:
Users that are interested in transformer-implementation are comparing it to the libraries listed below
- A π€-style implementation of BERT using lambda layers instead of self-attentionβ69Updated 4 years ago
- β46Updated 4 years ago
- Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queriesβ19Updated 3 years ago
- NLP Examples using the π€ librariesβ41Updated 3 years ago
- On Generating Extended Summaries of Long Documentsβ78Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.β145Updated 3 years ago
- Multitask Learning with Pretrained Transformersβ39Updated 3 years ago
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+β37Updated 3 years ago
- Google's BigBird (Jax/Flax & PyTorch) @ π€Transformersβ48Updated last year
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020β62Updated 9 months ago
- β15Updated 4 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER modelsβ32Updated 2 years ago
- What are the best Systems? New Perspectives on NLP Benchmarkingβ13Updated last year
- A tutorial on how to implement models for natural language inference using PyTorch and TorchText. [IN PROGRESS]β25Updated 4 years ago
- GrammarTagger β A Neural Multilingual Grammar Profiler for Language Learningβ27Updated 3 years ago
- β74Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paperβ52Updated last year
- code for the paper "Cluster & Tune: Boost Cold Start Performance in Text Classification" for ACL2022β28Updated 2 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).β20Updated 2 years ago
- Dynamic ensemble decoding with transformer-based modelsβ29Updated last year
- Low-code pre-built pipelines for experiments with huggingface/transformers for Data Scientists in a rush.β16Updated 4 years ago
- Tooling to play around with multilingual machine translation for Indian Languages.β21Updated 2 years ago
- Embedding Recycling for Language modelsβ38Updated last year
- QED: A Framework and Dataset for Explanations in Question Answeringβ115Updated 3 years ago
- Fine-tune transformers with pytorch-lightningβ44Updated 2 years ago
- PyTorch implementation of GLOMβ21Updated 2 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from π€ datasets with Apache Beam and Dataβ¦β24Updated 2 years ago
- Dense Passage Retrieval using tensorflow-keras on TPUβ15Updated 3 years ago
- Text summarization with python and transformerβ13Updated last year
- Sequence models in Numpyβ25Updated 4 years ago