stelladk / PretrainingBERTLinks
Pre-training BERT masked language models with custom vocabulary
β32Updated 3 years ago
Alternatives and similar repositories for PretrainingBERT
Users that are interested in PretrainingBERT are comparing it to the libraries listed below
Sorting:
- [ACL 2022] LinkBERT: A Knowledgeable Language Model π Pretrained with Document Linksβ449Updated 3 years ago
- Long Document Summarization Papersβ154Updated 2 years ago
- [ICLR 2022 spotlight]GreaseLM: Graph REASoning Enhanced Language Models for Question Answeringβ240Updated 9 months ago
- [WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representationsβ91Updated 4 years ago
- β42Updated 4 years ago
- [NAACL'21 & ACL'21] SapBERT: Self-alignment pretraining for BERT & XL-BEL: Cross-Lingual Biomedical Entity Linking.β213Updated 2 years ago
- [NeurIPS 2022] DRAGON π²: Deep Bidirectional Language-Knowledge Graph Pretrainingβ331Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)β75Updated last month
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.β105Updated 2 years ago
- Data and models for the SciFact verification task.β248Updated 2 years ago
- Code for ACL 2022 paper on the topic of long document summarization: MemSum: Extractive Summarization of Long Documents Using Multi-Step β¦β49Updated last year
- Search Engines with Autoregressive Language modelsβ295Updated 2 years ago
- Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)β11Updated last year
- β59Updated 4 years ago
- A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuningβ152Updated last year
- A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaβ¦β99Updated 3 years ago
- The corresponding code for our paper: A sequence-to-sequence approach for document-level relation extraction.β64Updated last year
- We evaluate many models used for biomedical and clinical nlp tasks, and train new models that perform much better.β163Updated 4 years ago
- β48Updated 3 years ago
- A Label Attention Model for ICD Coding from Clinical Textβ71Updated 3 years ago
- β92Updated last year
- Long-context pretrained encoder-decoder modelsβ96Updated 3 years ago
- β60Updated 3 years ago
- BioCreative-V CDR Corpusβ30Updated 7 years ago
- A repo to explore different NLP tasks which can be solved using T5β173Updated 5 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.β54Updated 2 years ago
- SciFive: a text-text transformer model for biomedical literatureβ98Updated last year
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)β62Updated 3 years ago
- Source code for paper "Learning from Noisy Labels for Entity-Centric Information Extraction", EMNLP 2021β55Updated 4 years ago
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch β¦β82Updated 3 years ago