ltgoslo / simple_elmo_training
Minimal code to train ELMo models in recent versions of TensorFlow
☆14Updated last year
Alternatives and similar repositories for simple_elmo_training:
Users that are interested in simple_elmo_training are comparing it to the libraries listed below
- numeric fused-head identification and resolution☆33Updated 5 years ago
- ☆17Updated last year
- Converter from UD-trees to BART representation☆36Updated last year
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Updated 5 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated 8 months ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- Fast IdEntification of State-of-The-Art models using adaptive bandit algorithms☆14Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Easy-to-use text representations extraction library based on the Transformers library.☆32Updated 2 years ago
- ☆17Updated 2 years ago
- ☆22Updated 2 years ago
- Generate BERT vocabularies and pretraining examples from Wikipedias☆18Updated 4 years ago
- ☆24Updated 5 years ago
- Featurize words into orthographic and phonological vectors.☆40Updated last year
- Statistics on multilingual datasets☆17Updated 2 years ago
- Combining encoder-based language models☆11Updated 3 years ago
- Implementation of Nested Named Entity Recognition using Flair☆24Updated 3 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- SCoPE: Sentence Content Paragraph Embeddings☆18Updated 5 years ago
- Resources related to EMNLP 2021 paper "FAME: Feature-Based Adversarial Meta-Embeddings for Robust Input Representations"☆13Updated 3 years ago
- Getting interpretable dimensions in word embedding spaces.☆14Updated last year
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- A framework to identify relations between ideas in temporal text corpora.☆28Updated 6 years ago
- ARCADE198 Dataset from the ACL 2018 MRQA Workshop☆15Updated 6 years ago
- A library for data streaming and augmentation☆20Updated 11 months ago
- ☆15Updated 4 years ago
- Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining☆12Updated 3 years ago