gooofy / transformer-lm
Transformer language model (GPT-2) with sentencepiece tokenizer
☆10Updated 5 years ago
Alternatives and similar repositories for transformer-lm:
Users that are interested in transformer-lm are comparing it to the libraries listed below
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Coreference resolution for German☆16Updated 7 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 4 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆43Updated 6 months ago
- ☆16Updated 5 years ago
- Plan and train German transformer models.☆23Updated 4 years ago
- A python wrapper for the multilingual temporal tagger HeidelTime.☆26Updated 3 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated last year
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 10 months ago
- An annotated corpus of argumentative microtexts☆39Updated 2 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago
- ☆15Updated 6 years ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆94Updated last week
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- German GPT-2 model☆32Updated 3 years ago
- Code and data for: Low Resource Grammatical Error Correction Using Wikipedia Edits (WNUT 2018)☆16Updated 9 months ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆157Updated 2 years ago
- Corpus and annotations for the CL-Aff Shared Task from the University of Pennsylvania☆19Updated 3 years ago
- ☆44Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- Diagnostic tests for linguistic capacities in language models☆66Updated 3 years ago
- GC4LM: A Colossal (Biased) language model for German☆13Updated 4 years ago
- A package for handy processing of semantic graphs such as AMR, with a special focus on standardized evaluation☆23Updated last week
- This is a german ELMo deep contextualized word representation. It is trained on a special German Wikipedia Text Corpus.☆28Updated 5 years ago
- ☆54Updated 3 years ago
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Updated 4 years ago