☆25Apr 28, 2020Updated 5 years ago
Alternatives and similar repositories for finer-data
Users that are interested in finer-data are comparing it to the libraries listed below
Sorting:
- Open broad-coverage corpus for Finnish named entity recognition.☆11Aug 22, 2020Updated 5 years ago
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆11Sep 26, 2022Updated 3 years ago
- Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)☆10Nov 4, 2019Updated 6 years ago
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 4 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- ☆13Dec 17, 2021Updated 4 years ago
- German Language Understanding Evaluation Benchmark @NAACL24☆22Dec 11, 2025Updated 2 months ago
- A lexicon compiler for non-suffixational morphologies☆13Jan 29, 2026Updated last month
- ☆16Jan 20, 2022Updated 4 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Apr 5, 2019Updated 6 years ago
- Neural network based lemmatizer for Finnish language☆11Sep 10, 2020Updated 5 years ago
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Mar 4, 2020Updated 5 years ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 10 months ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Faroese language☆18Updated this week
- Retter aktive nettside fra nynorsk til norsk (bokmål), for økt leseglede.☆21Sep 16, 2025Updated 5 months ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆71Sep 10, 2024Updated last year
- Named entity recognition built on top of BERT and keras-bert.☆14Aug 20, 2020Updated 5 years ago
- Experimental Finnish language model for SpaCy☆43Nov 14, 2024Updated last year
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Sep 29, 2020Updated 5 years ago
- ☆21Sep 10, 2025Updated 5 months ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Feb 21, 2026Updated last week
- Tools and Dataset to partecipate to Semeval 2018 Task 2 "Multilingual Emoji Detection"☆17Apr 9, 2018Updated 7 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115May 7, 2024Updated last year
- The Hazy Haskell Compiler☆55Updated this week
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆26Feb 16, 2026Updated 2 weeks ago
- ☆19Sep 29, 2019Updated 6 years ago
- ☆21Oct 19, 2020Updated 5 years ago
- Text processing library for sentiment analysis and related tasks☆27Oct 25, 2018Updated 7 years ago
- Processing the MPQA Corpus☆27Sep 22, 2018Updated 7 years ago
- Morphosyntactic tagger for Norwegian bokmål and nynorsk☆29Jun 20, 2023Updated 2 years ago
- LambdaBuffers toolkit for sharing types and their semantics between different languages☆32Feb 22, 2026Updated last week
- ☆29Dec 23, 2019Updated 6 years ago
- XED multilingual emotion datasets☆64May 3, 2023Updated 2 years ago
- Ubiflux Vigor ventilation system RS485 Modbus communications with Python☆11Feb 20, 2026Updated last week
- Implementation of Nested Named Entity Recognition using Flair☆24Oct 29, 2021Updated 4 years ago
- ParlaMint: Comparable Parliamentary Corpora☆74Nov 2, 2025Updated 4 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆206Updated this week
- Simple Go wrapper for Smart-ID API by SK ID Solutions☆10Mar 30, 2023Updated 2 years ago
- Super Mario is a legendary game we all cherish! In this project, we will deploy Super Mario on Amazon EKS (Elastic Kubernetes Service) us…☆11Feb 3, 2026Updated last month