youscan / language-models
☆26Updated last year
Related projects ⓘ
Alternatives and complementary repositories for language-models
- Dictionary of obscene words for Ukrainian language☆17Updated 3 years ago
- Russian RoBERTa☆29Updated 4 years ago
- ☆20Updated 7 years ago
- Probing suite for evaluation of Russian embedding and language models☆32Updated last month
- A corpus of Ukrainian Twitter texts + instructions for downloading and filtering texts.☆15Updated 5 years ago
- ☆36Updated last year
- nlp workshop at datafest siberia 2019☆22Updated last year
- Train punctuation and capitalization models for different languages☆24Updated 2 years ago
- Pytorch library for end-to-end transformer models training, inference and serving☆70Updated 2 years ago
- "Rossiya Segodnya" news dataset☆45Updated 5 years ago
- http://www.dialog-21.ru/evaluation/2016/letter/☆56Updated 7 years ago
- Russian SuperGLUE benchmark☆108Updated last year
- RuSimpleSentEval (RSSE) shared task repo☆21Updated 3 years ago
- ☆54Updated 6 years ago
- Simple python lib to tokenize texts into sentences and sentences to words. Small, fast and robust. Comes with ukrainian flavour☆60Updated last year
- AWD-LSTM language model trained on newspaper corpora with fast.ai☆27Updated 4 years ago
- Code for AINL2018 paper Deep Convolutional Networks for Supervised Morpheme Segmentation of Russian Language☆18Updated 5 years ago
- Fine-tuned Multilingual BERT and Multilingual USE for sentiment analysis in Russian. RuReviews, RuSentiment, Kaggle Russian News Dataset,…☆48Updated 3 years ago
- A collection of datasets for Ukrainian language☆55Updated 3 months ago
- Библиотека для извлечения статистик из текстов на русском языке.☆103Updated last year
- ☆30Updated 5 years ago
- BSNLP 2021☆32Updated last week
- Tools for shrinking fastText models (in gensim format)☆171Updated 6 months ago
- ☆78Updated 2 years ago
- (re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition☆17Updated 3 months ago
- Code and data of "Methods for Detoxification of Texts for the Russian Language" paper☆46Updated 2 months ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 2 years ago
- Ukrainian instruction-tuned language models and datasets☆84Updated 3 months ago
- A small library with distillation, quantization and pruning pipelines☆26Updated 3 years ago
- ☆23Updated 2 years ago