coastalcph / hierarchical-transformersLinks
Hierarchical Attention Transformers (HAT)
☆56Updated last year
Alternatives and similar repositories for hierarchical-transformers
Users that are interested in hierarchical-transformers are comparing it to the libraries listed below
Sorting:
- Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers☆49Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- ☆44Updated 2 years ago
- Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”☆62Updated 4 years ago
- BioELECTRA☆50Updated 3 years ago
- EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling☆31Updated 3 years ago
- ☆21Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆52Updated 2 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆136Updated last year
- Embedding Recycling for Language models☆38Updated last year
- ☆18Updated 4 years ago
- ☆100Updated 2 years ago
- The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…☆47Updated 6 months ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Updated 2 years ago
- BioM-Transformers: Building Large Biomedical Language Models with BERT, ALBERT and ELECTRA☆37Updated last year
- The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper☆69Updated 2 years ago
- Code repository for BEEP (Biomedical Evidence Enhanced Predictions) clinical outcome prediction system☆26Updated last year
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆76Updated 4 years ago
- LTG-Bert☆33Updated last year
- ☆49Updated 3 years ago
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆56Updated 2 years ago
- Long-context pretrained encoder-decoder models☆95Updated 2 years ago
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆23Updated 3 months ago
- Multitask Learning with Pretrained Transformers☆40Updated 4 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Updated 2 years ago
- A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch☆227Updated 2 years ago
- PropSegmEnt is an annotated dataset for segmenting English text into propositions, and recognizing proposition-level entailment relations…☆19Updated 2 years ago
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆41Updated last year
- State of the art Semantic Sentence Embeddings☆99Updated 3 years ago