TextoKit - is a set of components for Natural Language Processing based on Apache UIMA platform.
☆16Jul 6, 2016Updated 9 years ago
Alternatives and similar repositories for textokit-core
Users that are interested in textokit-core are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 4 years ago
- The set of Apache UIMA addons & utilities.Some of them are language-independent. The others may be Russian language-specific.☆28Oct 8, 2021Updated 4 years ago
- Extended Wikilinks dataset description☆15Apr 1, 2018Updated 7 years ago
- MapReduce performance testing using teragen and terasort☆18Aug 26, 2021Updated 4 years ago
- Word Embeddings for Low Resource Languages: The Case of Buryat☆10Mar 12, 2025Updated last year
- My best solution for mlbootcamp4 competition☆11Jun 11, 2017Updated 8 years ago
- ☆13Nov 20, 2016Updated 9 years ago
- Online service for speech and text markuping☆29Nov 5, 2014Updated 11 years ago
- Weka package for parameter optimization, similar to GridSearch, but with arbitrary number of parameters.☆10Feb 17, 2021Updated 5 years ago
- Samsung Natural Language Processing Pipeline (basically for Russian language): morphology, dependency parser and much more☆59Oct 3, 2020Updated 5 years ago
- "Rossiya Segodnya" news dataset☆46Sep 25, 2019Updated 6 years ago
- Habrahabr API Client Library for Python☆37Jun 20, 2014Updated 11 years ago
- http server to take screenshots of websites☆15Sep 13, 2012Updated 13 years ago
- ☆24Aug 15, 2017Updated 8 years ago
- Run a local PostgreSQL☆16Oct 3, 2018Updated 7 years ago
- stripe demo using golang ;)☆12Apr 6, 2015Updated 10 years ago
- Russian data from the SynTagRus corpus.☆86Nov 12, 2025Updated 4 months ago
- A simple Go app for Deis, the open source PaaS☆22May 19, 2017Updated 8 years ago
- Python integration for the GATE framework☆21Nov 6, 2024Updated last year
- Using embedding-based loss functions for phonetics/speech recognition.☆17Nov 24, 2014Updated 11 years ago
- collection with description of super-resolution related papers, repositories, datasets, loss functions and etc.☆11Dec 12, 2023Updated 2 years ago
- A set of methods for finding an appropriate number of topics in a text collection☆16Apr 14, 2025Updated 11 months ago
- LUNA: a Framework for Language Understanding and Naturalness Assessment.☆12Sep 9, 2023Updated 2 years ago
- Analysis of Russian mass media articles about internet regulation with structural topic modeling☆11May 15, 2018Updated 7 years ago
- Library of text structuration components☆31Dec 4, 2021Updated 4 years ago
- Evaluation tools for the RUSSE evaluation campaign.☆37Jun 11, 2017Updated 8 years ago
- JSON-RPC 2.0 Implementation in Rust☆13Jul 13, 2017Updated 8 years ago
- [experiment] CRF-based disambiguation engine for pymorphy2☆10May 9, 2016Updated 9 years ago
- Morphological analyzer `mystem` (Russian language) wrapper for JVM languages☆24Aug 29, 2024Updated last year
- PANiC - PAraphrasing Noun-Compounds☆15Apr 6, 2018Updated 7 years ago
- v4l2 implementation for erlang. Simple and working.☆11Jun 2, 2020Updated 5 years ago
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated last year
- Toxic Comments Detection in Russian.☆29Feb 19, 2021Updated 5 years ago
- RUSSE: Russian Semantic Evaluation.☆16Mar 1, 2022Updated 4 years ago
- Narwhal is a keyword and KEY NARRATIVE manager that creates language-aware classes. Because Narhwal does not use NLP it avoids complexity…☆12Oct 16, 2018Updated 7 years ago
- Data and code for the experiments in the Outlier Detection task proposed by Camacho-Collados et al.☆13Aug 28, 2018Updated 7 years ago
- QuickCheck implementation for Crystal Language☆12Mar 29, 2016Updated 9 years ago
- Rust client library for ClickHouse☆17Jan 4, 2026Updated 2 months ago
- Watset: Automatic Induction of Synsets from a Graph of Synonyms☆16Jul 7, 2019Updated 6 years ago