stefan-it / gc4lm
GC4LM: A Colossal (Biased) language model for German
☆13Updated 3 years ago
Alternatives and similar repositories for gc4lm:
Users that are interested in gc4lm are comparing it to the libraries listed below
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 6 months ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Converter from UD-trees to BART representation☆36Updated 11 months ago
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated 2 months ago
- BERT models for many languages created from Wikipedia texts☆33Updated 4 years ago
- Compiled tools, datasets, and other resources for historical text normalization.☆17Updated 5 years ago
- Tool for parsing and converting various span encoding schemes.☆22Updated last year
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Multilingual Open Text☆25Updated 3 months ago
- Efficient Sentence Embedding via Semantic Subspace Analysis☆14Updated 4 years ago
- MultiLexNorm 2021 competition system from ÚFAL☆15Updated 3 years ago
- List of corpora annotated for coreference for different languages☆17Updated 6 months ago
- ☆13Updated 3 years ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆13Updated 8 months ago
- several algorithms for converting dependency structures into constituency structures.☆10Updated 3 years ago
- UniParse: A universal graph-based parsing toolkit☆10Updated 5 years ago
- ☆24Updated 5 years ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- ☆17Updated 2 years ago
- An easy-to-use API for analyzing INCEpTION annotation projects.☆16Updated last year
- ParaNames: A multilingual resource for parallel names☆30Updated 9 months ago
- ☆64Updated 2 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆14Updated 7 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆68Updated 3 years ago
- Wikipedia EXhaustive Entity Annotator (LREC 2020)☆15Updated 9 months ago
- codebase for the Text-based NP Enrichment (TNE) paper☆20Updated 11 months ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago