machine-intelligence-laboratory / TopicNet
Interface for easier topic modelling.
☆139Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for TopicNet
- Tools for shrinking fastText models (in gensim format)☆173Updated 6 months ago
- Russian RoBERTa☆29Updated 4 years ago
- NER, syntax markup visualizations☆134Updated last year
- Probing suite for evaluation of Russian embedding and language models☆32Updated last month
- "Rossiya Segodnya" news dataset☆45Updated 5 years ago
- ☆36Updated last year
- Models for automatic abstractive summarization☆171Updated 2 years ago
- DEREK (Domain Entities and Relations Extraction Kit)☆10Updated last year
- RuREBus shared task repo☆30Updated 3 years ago
- nlp workshop at datafest siberia 2019☆22Updated last year
- Named entity recognizer based on ELMo or BERT as feature extractor and CRF as final classifier☆81Updated last year
- Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке☆32Updated 3 years ago
- A list of pretrained Transformer models for the Russian language.☆174Updated 4 years ago
- Topic modeling with BigARTM: an interactive book☆59Updated 5 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆61Updated last year
- Simple library to work with pre-trained ELMo models in TensorFlow☆52Updated last year
- Mini-library for producing graph visualizations from embedding models☆28Updated 4 years ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 2 months ago
- BSNLP 2021☆32Updated 2 weeks ago
- Pytorch library for end-to-end transformer models training, inference and serving☆70Updated 2 years ago
- Project on text topics evolution over time analysis☆82Updated 2 years ago
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆63Updated 3 years ago
- A simple and fast rule-based sentence segmentation. Tested on OpenCorpora and SynTagRus datasets.☆53Updated 6 years ago
- Russian SuperGLUE benchmark☆108Updated last year
- Библиотека для извлечения статистик из текстов на русском языке.☆103Updated last year
- REST API for sentence tokenization and embedding using Multilingual Universal Sentence Encoder.☆52Updated 3 years ago
- Russian paraphrasers. Generate paraphrases with mt5, gpt2, etc.☆52Updated last year
- NLP course @ CS Faculty, HSE☆15Updated 4 years ago
- RUSSE 2022: Russian Text Detoxification Based on Parallel Corpora☆20Updated 2 years ago