buschmo / Simple-German-CorpusLinks
Code to create the dataset from "A New Aligned Simple German Corpus
☆10Updated last year
Alternatives and similar repositories for Simple-German-Corpus
Users that are interested in Simple-German-Corpus are comparing it to the libraries listed below
Sorting:
- TextComplexityDE dataset consists of 1000 sentences in the German language with subjective complexity rating, collected from German learn…☆13Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 4 years ago
- 🖋 Resource and Tool for Writing System Identification -- LREC 2024☆14Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 8 months ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Updated 6 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated last year
- Annotation Tool for Text Simplification Corpora☆17Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆108Updated last year
- Natural language understanding benchmarks for Norwegian☆14Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆99Updated last year
- SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages☆9Updated last year
- Information extraction from English and German texts based on predicate logic☆137Updated 2 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated 2 years ago
- Compiled tools, datasets, and other resources for historical text normalization.