stefan-it / gc4lm
GC4LM: A Colossal (Biased) language model for German
☆13Updated 3 years ago
Alternatives and similar repositories for gc4lm:
Users that are interested in gc4lm are comparing it to the libraries listed below
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 7 months ago
- A survey of corpora for Germanic low-resource languages and dialects☆25Updated 3 months ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- Compiled tools, datasets, and other resources for historical text normalization.☆18Updated 5 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Neural models for detecting and masking personal information from texts☆15Updated 2 years ago
- ParaNames: A multilingual resource for parallel names☆31Updated 10 months ago
- ☆25Updated last year
- An easy-to-use API for analyzing INCEpTION annotation projects.☆16Updated last year
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).☆13Updated 9 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Multilingual Open Text☆25Updated 4 months ago
- Parsing only with Pretraining Networks☆16Updated 7 months ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆44Updated 4 months ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆84Updated 2 weeks ago
- numeric fused-head identification and resolution☆33Updated 5 years ago
- Tower Parse: Low-Resource Dependency Parsing via Hierarchical Source Selection☆15Updated 3 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- ☆24Updated 5 years ago
- The Universal Decompositional Semantics (UDS) dataset and the Decomp toolkit☆57Updated last year
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Automatically detect errors in annotated corpora.