☆25Apr 28, 2020Updated 5 years ago
Alternatives and similar repositories for finer-data
Users that are interested in finer-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open broad-coverage corpus for Finnish named entity recognition.☆12Aug 22, 2020Updated 5 years ago
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆11Sep 26, 2022Updated 3 years ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- Finnish data☆11Nov 12, 2025Updated 4 months ago
- Named entity recognition built on top of BERT and keras-bert.☆14Aug 20, 2020Updated 5 years ago
- XED multilingual emotion datasets☆64May 3, 2023Updated 2 years ago
- Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)☆33Oct 11, 2022Updated 3 years ago
- ☆13Dec 17, 2021Updated 4 years ago
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 3 months ago
- Morphological analysis of Finnish language for Java☆13Feb 20, 2024Updated 2 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Feb 21, 2026Updated last month
- Neural network based lemmatizer for Finnish language☆11Sep 10, 2020Updated 5 years ago
- CRF-based Morphological Tagging and Lemmatization☆38Oct 16, 2019Updated 6 years ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15May 3, 2021Updated 4 years ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆31May 23, 2022Updated 3 years ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆71Sep 10, 2024Updated last year
- The SETimes.HR+ Croatian dependency treebank☆16Dec 27, 2016Updated 9 years ago
- Code for the paper Data-to-Text Generation with Iterative Text Editing☆14Mar 23, 2021Updated 5 years ago
- System for automatic pronominal resolution for Russian☆14Apr 3, 2020Updated 5 years ago
- Fact Enhanced News Generation☆12Jul 18, 2023Updated 2 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Apr 5, 2019Updated 6 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Sep 29, 2020Updated 5 years ago
- ☆31Dec 13, 2023Updated 2 years ago
- Chatbot Question Dataset Of Questions about the Covid-19 crisis☆11May 7, 2020Updated 5 years ago
- Dockerized yle-dl☆15Oct 28, 2025Updated 4 months ago
- Tools and Dataset to partecipate to Semeval 2018 Task 2 "Multilingual Emoji Detection"☆17Apr 9, 2018Updated 7 years ago
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Mar 4, 2020Updated 6 years ago
- ☆21Oct 19, 2020Updated 5 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 3 years ago
- Use built-in macOS optical character recognition (OCR) via the command line☆18Nov 17, 2025Updated 4 months ago
- ☆21Sep 10, 2025Updated 6 months ago
- The Mingled Structured Predictor☆29Mar 28, 2024Updated last year
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Jul 4, 2018Updated 7 years ago
- Morphosyntactic tagger for Norwegian bokmål and nynorsk☆29Jun 20, 2023Updated 2 years ago
- ☆17Jan 18, 2026Updated 2 months ago
- ☆10Aug 1, 2018Updated 7 years ago
- The code to conduct Bayesian geographically weighted regression☆11Feb 20, 2022Updated 4 years ago
- Code used to produce experimental results for the paper "Deep Structured Prediction with Nonlinear Output Activations"☆11May 6, 2019Updated 6 years ago
- ☆12Mar 4, 2022Updated 4 years ago