dbklim / Russian_subtitles_dataset

Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.
23Updated 5 years ago

Related projects

Alternatives and complementary repositories for Russian_subtitles_dataset