dbklim/Russian_subtitles_dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dbklim/Russian_subtitles_dataset)

dbklim / Russian_subtitles_dataset

Preprocessing of the dataset of 347 subtitles for the TV series (thanks to Taiga Corpus) to build a word2vec model, JamSpell model, neural network training, chat bot training or in any other NLP task.

☆26

Alternatives and similar repositories for Russian_subtitles_dataset

Users that are interested in Russian_subtitles_dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FastTrackiverse / fasttrackpy
View on GitHub
A fasttrack implementation in python
☆13Feb 10, 2026Updated 5 months ago
kuk / crawl-vk-catalog
View on GitHub
☆16May 19, 2016Updated 10 years ago
ainagari / monopoly
View on GitHub
☆14Nov 22, 2024Updated last year
ai-forever / fbc3_aij2023
View on GitHub
☆22Oct 4, 2023Updated 2 years ago
CoEDL / elan-helpers
View on GitHub
Tools and scripts for working with ELAN
☆10Aug 4, 2022Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
agricolamz / lingglosses
View on GitHub
R package that helps to render interlinear glossed linguistic examples in html rmarkdown documents and then semi-automatically compiles t…
☆17Nov 18, 2025Updated 8 months ago
beta-decay / Sumerian
View on GitHub
A programming language written in the ancient language Sumerian (𒅴 𒆰)
☆14Aug 8, 2018Updated 7 years ago
dbklim / StressRNN
View on GitHub
Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…
☆45Aug 7, 2024Updated last year
david-ryan-snyder / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆10Aug 22, 2019Updated 6 years ago
abuccts / wikt2pron
View on GitHub
A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format
☆34Jul 5, 2019Updated 7 years ago
zhepeiw / cssl_sound
View on GitHub
☆14Jan 17, 2023Updated 3 years ago
StanislavPetrovV / Python-Maze-Generator
View on GitHub
Based on Recursive Backtracker algo
☆11Oct 25, 2020Updated 5 years ago
valiotti / leftjoin
View on GitHub
LEFTJOIN.ru public repository
☆23Dec 8, 2022Updated 3 years ago
superscriptjs / sfacts
View on GitHub
Scripted fact system for SuperScript
☆11Sep 15, 2017Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sgherbst / pyhigh
View on GitHub
Python library for accessing elevation data
☆24Sep 23, 2024Updated last year
aghasemi / ChronologicalPersianPoetryDataset
View on GitHub
A chronological (up to the century in which the poet has lived) of Persian poetry, extracted from the brilliant Ganjoor database
☆18Jan 31, 2021Updated 5 years ago
eatsleepraverepeat / reMUDE
View on GitHub
(re)Implementation of Learning Multi-level Dependencies for Robust Word Recognition
☆17Jul 25, 2024Updated 2 years ago
pythontoday / vkBot
View on GitHub
☆11Dec 14, 2020Updated 5 years ago
Agisight / rf-keyboard-corpora
View on GitHub
☆20Jun 28, 2026Updated 3 weeks ago
adamtuliper / VampKid3D
View on GitHub
☆13Jun 30, 2015Updated 11 years ago
cmusphinx / pocketsphinx-ruby
View on GitHub
Ruby speech recognition with Pocketsphinx
☆13May 14, 2015Updated 11 years ago
rakoo / goax
View on GitHub
A pure-go implementation of the Axolotl Ratchet, extracted from pond
☆21Feb 8, 2017Updated 9 years ago
TatianaShavrina / taiga_site
View on GitHub
☆88Oct 19, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
gzm55 / docker-vpn-client
View on GitHub
VPN clients in docker based on alpine.
☆13Sep 18, 2019Updated 6 years ago
concordant / c-markdown-editor
View on GitHub
A CRDT based collaborative markdown editor.
☆16Sep 17, 2022Updated 3 years ago
ArtemF42 / let-it-go
View on GitHub
☆19May 3, 2026Updated 2 months ago
jonsafari / perstem
View on GitHub
Persian stemmer and morphological analyzer
☆19Mar 30, 2016Updated 10 years ago
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
AhlemGit / Arabic-WordNet-To-SQLite
View on GitHub
This repository is about how to build an SQLite version of the Arabic WordNet database.
☆11Mar 19, 2019Updated 7 years ago
kubowania / burger-app
View on GitHub
For a Document API and ExpressJS Demo
☆11Sep 20, 2021Updated 4 years ago
Ubenwa / cryceleb2023
View on GitHub
☆12Mar 18, 2024Updated 2 years ago
jonsafari / tok-tok
View on GitHub
A fast, simple, multilingual tokenizer
☆29May 24, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yosssi / rendergold
View on GitHub
Martini middleware/handler for parsing Gold templates and rendering HTML
☆15May 21, 2014Updated 12 years ago
ryanphung / chinese-hanviet-cognates
View on GitHub
A Python notebook that outputs common Han Viet cognates for Chinese words.
☆27Oct 3, 2021Updated 4 years ago
libp2p / go-libp2p-record
View on GitHub
signed records for use with routing systems
☆26Nov 20, 2025Updated 8 months ago
NeilAlishev / TelegramBot
View on GitHub
☆14Apr 21, 2021Updated 5 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
fursovia / geometric_embedding
View on GitHub
"Zero-Training Sentence Embedding via Orthogonal Basis" paper implementation
☆19Dec 23, 2018Updated 7 years ago
wynand1004 / 6502_Assembly_Simulator
View on GitHub
A Simple 6502 CPU Simulator for students to use to learn Assembly Language
☆12Oct 17, 2017Updated 8 years ago