jouniluoma / bert-ner-cmv
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for bert-ner-cmv
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆26Updated 3 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated last year
- EMNLP 2021 Tutorial: Multi-Domain Multilingual Question Answering☆38Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆46Updated 3 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Updated 4 years ago
- ☆67Updated 3 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- A text augmentation tool for named entity recognition.☆53Updated 3 years ago
- reference pytorch code for intent classification☆45Updated last month
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆21Updated last month
- ☆73Updated 3 years ago
- ☆33Updated last year
- ☆16Updated last year
- ☆29Updated 2 years ago
- ☆36Updated 2 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Updated 2 years ago
- Official codebase accompanying our ACL 2022 paper "RELiC: Retrieving Evidence for Literary Claims" (https://relic.cs.umass.edu).☆20Updated 2 years ago
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated last year
- ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost☆39Updated last year
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆25Updated last year
- ☆56Updated 3 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated last year
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Updated 4 years ago
- Data programming by demonstration for information extraction and span annotation☆35Updated 3 years ago
- This repository contains the code for applying One-Token Approximation to a pretrained language model using subword-level tokenization.☆11Updated 4 years ago
- Pre-training BART in Flax on The Pile dataset☆20Updated 3 years ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆82Updated last month