ltgoslo / norbench
Natural language understanding benchmarks for Norwegian
☆13Updated 8 months ago
Related projects: ⓘ
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago
- Norwegian Speech Transformer Models☆17Updated 5 months ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated last year
- Data for the HIPE 2022 shared task.☆14Updated 9 months ago
- Python Multilingual Ucrel Semantic Analysis System☆29Updated last month
- BERT and ELECTRA models trained on Europeana Newspapers☆35Updated 2 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆65Updated last year
- Repository for DISRPT2023 shared task☆16Updated last month
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆23Updated 4 months ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated 9 months ago
- CD20200004 from 01/01/2021 to 31/12/2023 - LIG UGA - Python Notebook and Models for the MT Lab @ ALPS 2022☆14Updated 5 months ago
- ☆49Updated 6 months ago
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆80Updated 3 weeks ago
- Noise-robust de-duplication at scale☆15Updated last year
- This repository provides the source code used to automatically generate the book summarization datasets described in the paper titled "Ec…☆11Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆27Updated last year
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆27Updated 3 months ago
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆36Updated 11 months ago
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Updated 4 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆96Updated 5 months ago
- A survey of corpora for Germanic low-resource languages and dialects☆24Updated last month
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Updated 4 years ago
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆82Updated last year
- FrameBERT: Conceptual Metaphor Detection with Frame Embedding Learning. Presented at EACL 2023.☆23Updated 9 months ago
- A python module for evaluating NERC and NEL system performances as defined in the HIPE shared tasks (formerly CLEF-HIPE-2020-scorer).