☆25Apr 28, 2020Updated 6 years ago
Alternatives and similar repositories for finer-data
Users that are interested in finer-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open broad-coverage corpus for Finnish named entity recognition.☆12Aug 22, 2020Updated 5 years ago
- Experimental Finnish language model for SpaCy☆43Apr 13, 2026Updated last month
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆116May 7, 2024Updated 2 years ago
- Latin texts annotated for named entities and NER tagger used for the Herodotos Project (Ohio State University / Ghent University)☆12Sep 26, 2022Updated 3 years ago
- HFST spell checker library and command line tool☆15Feb 20, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Research into identifying and correcting incorrect labels in the CoNLL-2003 corpus.☆12May 11, 2021Updated 5 years ago
- HFST optimized-lookup standalone library and command line tool☆13Feb 27, 2018Updated 8 years ago
- XED multilingual emotion datasets☆64May 3, 2023Updated 3 years ago
- Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)☆33Oct 11, 2022Updated 3 years ago
- German Language Understanding Evaluation Benchmark @NAACL24☆23Dec 11, 2025Updated 5 months ago
- [NeurIPS 2024] 🕸 GlotCC Dataset and Pipline☆20Apr 6, 2025Updated last year
- Morphological analysis of Finnish language for Java☆13Feb 20, 2024Updated 2 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Feb 21, 2026Updated 3 months ago
- Neural network based lemmatizer for Finnish language☆11Sep 10, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CRF-based Morphological Tagging and Lemmatization☆38Oct 16, 2019Updated 6 years ago
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15May 3, 2021Updated 5 years ago
- ☆19Sep 29, 2019Updated 6 years ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆32May 23, 2022Updated 4 years ago
- Byte-level byte pair encoding (BPE) in Haskell☆17May 27, 2024Updated last year
- Natural language processing in examples and games☆25Mar 11, 2026Updated 2 months ago
- Norwegian Named Entities annotations on top of NDT (Norwegian Dependency Treebank)☆71Sep 10, 2024Updated last year
- The SETimes.HR+ Croatian dependency treebank☆16Dec 27, 2016Updated 9 years ago
- A lexicon compiler for non-suffixational morphologies☆14Jan 29, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Fact Enhanced News Generation☆12Jul 18, 2023Updated 2 years ago
- A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, …☆34Apr 5, 2019Updated 7 years ago
- A Java library for manipulating JSGF Grammars.☆12Dec 30, 2021Updated 4 years ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Sep 29, 2020Updated 5 years ago
- Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Faroese language☆18Updated this week
- Swedish translated, AFINN-based sentiment analysis for Node.js.☆13Jun 1, 2017Updated 8 years ago
- ☆31Dec 13, 2023Updated 2 years ago
- ☆32Aug 4, 2021Updated 4 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Feb 16, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Retter aktive nettside fra nynorsk til norsk (bokmål), for økt leseglede.☆20Sep 16, 2025Updated 8 months ago
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆16Sep 20, 2023Updated 2 years ago
- ☆13May 4, 2017Updated 9 years ago
- Dockerized yle-dl☆15Oct 28, 2025Updated 6 months ago
- Tools and Dataset to partecipate to Semeval 2018 Task 2 "Multilingual Emoji Detection"☆17Apr 9, 2018Updated 8 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- A pure-python implementation of BK-Trees☆17Feb 19, 2023Updated 3 years ago