stelladk / PretrainingBERT
Pre-training BERT masked language models with custom vocabulary
☆32Updated 3 years ago
Alternatives and similar repositories for PretrainingBERT:
Users that are interested in PretrainingBERT are comparing it to the libraries listed below
- cRocoDiLe is a dataset extraction tool for Relation Extraction using Wikipedia and Wikidata presented in REBEL (EMNLP 2021).☆66Updated last year
- [WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations☆87Updated 3 years ago
- EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?☆56Updated 2 years ago
- ☆44Updated 2 years ago
- Improving Biomedical Pretrained Language Models with Knowledge [BioNLP 2021]☆65Updated 2 years ago
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆56Updated 2 years ago
- Code repository for BEEP (Biomedical Evidence Enhanced Predictions) clinical outcome prediction system☆26Updated last year
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆52Updated last year
- Code repo for EMNLP21 paper "Zero-Shot Information Extraction as a Unified Text-to-Triple Translation"☆108Updated 11 months ago
- BioM-Transformers: Building Large Biomedical Language Models with BERT, ALBERT and ELECTRA☆37Updated last year
- Data and models for the SciFact verification task.☆230Updated last year
- MedDistant19: Towards an Accurate Benchmark for Broad-Coverage Biomedical Relation Extraction (COLING 2022)☆17Updated 2 years ago
- Hierarchical Attention Transformers (HAT)☆54Updated last year
- ☆71Updated 7 months ago
- The source code of the Sudowoodo paper in ICDE 2023☆15Updated last year
- BioCreative-V CDR Corpus☆27Updated 6 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆67Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆145Updated last year
- ☆58Updated 2 years ago
- Code and model checkpoints for the MultiVerS model for scientific claim verification.☆45Updated last year
- A Python Commonsense Knowledge Inference Toolkit☆64Updated last year
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Updated 2 years ago
- [ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links☆437Updated 3 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated 2 years ago
- The autoregressive information extraction system GenIE (Generative Information Extraction) implemented in PyTorch.☆102Updated 2 years ago
- Pytorch Implementation of EncT5: Fine-tuning T5 Encoder for Non-autoregressive Tasks☆63Updated 3 years ago
- The official code for PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document Summarization☆156Updated 2 years ago
- ☆41Updated 3 years ago
- Data and code for the SciFact-Open task☆25Updated last year
- ☆61Updated 2 years ago