leondz / dagw_pageLinks
The Danish Gigaword project
☆16Updated 4 years ago
Alternatives and similar repositories for dagw_page
Users that are interested in dagw_page are comparing it to the libraries listed below
Sorting:
- Compass-aligned Distributional Embeddings. Align embeddings from different corpora☆41Updated 2 years ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆98Updated 9 months ago
- DaNLP is a repository for Natural Language Processing resources for the Danish Language.☆207Updated 8 months ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Updated 6 years ago
- ☆40Updated 4 years ago
- A python package to enrich Twitter Data☆75Updated 2 years ago
- Package to extract connotation frames☆90Updated last year
- ☆36Updated 9 months ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆29Updated 5 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- A collection of Danish Transformers☆30Updated 4 years ago
- Quick implementation of Monroe et al.'s algorithm for comparing languages☆53Updated 5 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆101Updated last year
- Tools to train and explore diachronic word embeddings from Big Historical Data☆28Updated 8 months ago
- Bag of, not words, but tricks!☆68Updated last year
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 4 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆77Updated 4 years ago
- Simple customizable pipeline tool for anonymizing Danish text.☆11Updated last year
- ☆23Updated 4 years ago
- Pre-trained Nordic models for BERT☆174Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- KIND: an Italian Multi-Domain Dataset for Named Entity Recognition☆15Updated 2 years ago
- Repository for the paper Us vs. Them: A Dataset of Populist Attitudes, News Bias and Emotions☆17Updated last year
- Interpretable data visualizations for understanding how texts differ at the word level☆281Updated 8 months ago
- Code for the CUP Elements on text analysis in Python for social scientists☆137Updated 3 years ago
- The robust European language model benchmark.☆129Updated this week
- Code and data for "Superbizarre Is Not Superb: Derivational Morphology Improves BERT's Interpretation of Complex Words"☆16Updated 4 years ago
- German Parliamentary Corpus (GerParCor)☆27Updated this week