LydiaXiaohongLi / Albert_Finetune_with_Pretrain_on_Custom_Corpus
1. Pretrain Albert on custom corpus 2. Finetune the pretrained Albert model on downstream task
☆33Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Albert_Finetune_with_Pretrain_on_Custom_Corpus
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆80Updated 2 years ago
- Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation☆31Updated last year
- ☆29Updated 4 years ago
- ☆42Updated 4 years ago
- BERT which stands for Bidirectional Encoder Representations from Transformations is the SOTA in Transfer Learning in NLP.☆56Updated 4 years ago
- [NAACL 2019] code for "Pragmatically Informative Text Generation" https://arxiv.org/abs/1904.01301☆47Updated 5 years ago
- Named Entity Recognition as Dependency Parsing☆39Updated 4 years ago
- Fine-tuned Transformers compatible BERT models for Sequence Tagging☆40Updated 4 years ago
- Annotated corpus and code for "Extracting COVID-19 Events from Twitter".☆46Updated 2 years ago
- Code for ACL '19 paper: Towards Improving Neural Named Entity Recognition with Gazetteers☆32Updated 3 years ago
- Implementation of paper "Learning to Encode Text as Human-Readable Summaries using GAN"☆65Updated 5 years ago
- ☆34Updated 2 years ago
- ☆36Updated 3 years ago
- EmbedRank implemented in Python.☆15Updated 5 months ago
- Evidence-based QA system for community question answering.☆103Updated 3 years ago
- Source code for paper Neural Architectures for Nested NER through Linearization☆91Updated 5 years ago
- a Fairseq fork for sequence tagging/labeling tasks☆31Updated 4 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 4 years ago
- ☆31Updated last year
- Research code for ACL 2020 paper: "Distilling Knowledge Learned in BERT for Text Generation".☆130Updated 3 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated last year
- ☆40Updated 3 years ago
- NoiseMix - data generation for natural language☆41Updated 6 years ago
- Dataset and baseline for ACL 2019 paper "XQA: A Cross-lingual Open-domain Question Answering Dataset"☆89Updated 3 years ago
- Joint Extraction & Compression text Summarization☆41Updated 5 years ago
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆117Updated 3 years ago
- Formate converter from one type of qa task datasets to another type☆39Updated 5 years ago
- CIKM 2020: Speaker-Aware BERT for Multi-Turn Response Selection in Retrieval-Based Chatbots☆75Updated 4 years ago
- Selections from EMNLP 2020☆59Updated 3 years ago