coastalcph / hierarchical-transformersLinks

Hierarchical Attention Transformers (HAT)

☆59

Alternatives and similar repositories for hierarchical-transformers

Users that are interested in hierarchical-transformers are comparing it to the libraries listed below

Sorting:

allenai / EmbeddingRecycling
Embedding Recycling for Language models
☆38Updated 2 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆96Updated 2 years ago
thevasudevgupta / bigbird
Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers
☆49Updated 2 years ago
IBM / model-recycling
Ranking of fine-tuned HF models as base models.
☆36Updated 2 months ago
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated 2 years ago
idiap / g2g-transformer
Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”
☆62Updated 4 years ago
cimeister / typical-sampling
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆81Updated 3 years ago
stanfordnlp / ColBERT-QA
Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)
☆41Updated 4 years ago
jaketae / ensemble-transformers
Ensembling Hugging Face transformers made easy
☆62Updated 2 years ago
ltgoslo / ltg-bert
LTG-Bert
☆34Updated last year
oriram / spider
☆54Updated 2 years ago
facebookresearch / ELECTRA-Fewshot-Learning
This repository contains the code for paper Prompting ELECTRA Few-Shot Learning with Discriminative Pre-Trained Models.
☆48Updated 3 years ago
lucidrains / electra-pytorch
A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch
☆235Updated 2 years ago
martiansideofthemoon / rankgen
Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…
☆138Updated 2 years ago
joeljang / ELM
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆98Updated 2 years ago
Mivg / SLED
The official repository for Efficient Long-Text Understanding Using Short-Text Models (Ivgi et al., 2022) paper
☆70Updated 2 years ago
amazon-science / efficient-longdoc-classification
☆47Updated 3 years ago
facebookresearch / bart_ls
Long-context pretrained encoder-decoder models
☆96Updated 3 years ago
philschmid / deep-learning-habana-huggingface
☆32Updated 2 years ago
ThomasScialom / T0_continual_learning
Adding new tasks to T0 without catastrophic forgetting
☆33Updated 3 years ago
ShiZhengyan / PowerfulPromptFT
[NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…
☆76Updated last year
gucci-j / light-transformer-emnlp2021
EMNLP 2021 - Frustratingly Simple Pretraining Alternatives to Masked Language Modeling
☆34Updated 4 years ago
g8a9 / ear
Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"
☆50Updated 3 years ago
nreimers / se-pytorch-xla
☆21Updated 4 years ago
asahi417 / relbert
The official implementation of "Distilling Relation Embeddings from Pre-trained Language Models, EMNLP 2021 main conference", a high-qual…
☆47Updated 11 months ago
martiansideofthemoon / hurdles-longform-qa
Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://a…
☆46Updated 3 years ago
facebookresearch / BELA
Bi-encoder entity linking architecture
☆51Updated last year
google-research / t5x_retrieval
☆101Updated 2 years ago
Mihir3009 / In-BoXBART
In-BoXBART: Get Instructions into Biomedical Multi-task Learning
☆14Updated 3 years ago
facebookresearch / EditEval
An instruction-based benchmark for text improvements.
☆143Updated 3 years ago