orgtre/top-open-subtitles-sentences

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/orgtre/top-open-subtitles-sentences)

orgtre / top-open-subtitles-sentences

Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code

☆63

Alternatives and similar repositories for top-open-subtitles-sentences

Users that are interested in top-open-subtitles-sentences are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

olastor / german-word-frequencies
View on GitHub
Simple word to frequency mappings for the german language based on text corpora and using CISTEM stemmer.
☆14Apr 3, 2021Updated 5 years ago
frekwencja / most-common-words-multilingual
View on GitHub
🏆 • 5050 most frequent words in 109 languages
☆57Dec 8, 2022Updated 3 years ago
KendoClaw1 / Open-Redirection-Scanner
View on GitHub
a python tool used to scan for Open redirection vulnerability
☆20Dec 29, 2017Updated 8 years ago
Loubaris / scrapeaw
View on GitHub
ScrapeAW is a framework that without API scrape IPs across the world using Shodan
☆11May 16, 2024Updated 2 years ago
milahu / opensubtitles-scraper-new-subs
View on GitHub
temporary files created by opensubtitles-scraper
☆18Feb 3, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jruipinto / ImageMagick-action
View on GitHub
A GitHub action to auto optimize uploaded images using ImageMagick
☆10Apr 28, 2024Updated 2 years ago
jxnl / resume-nextjs
View on GitHub
☆10Jan 16, 2024Updated 2 years ago
bnosac / nametagger
View on GitHub
Named Entity Recognition with the Nametag Maximum Entropy Markov model
☆12Feb 9, 2026Updated 5 months ago
zphw / dns-cache-poisoning-demo
View on GitHub
An isolated environment for DNS cache poisoning attack investigation and demonstration.
☆10Nov 22, 2020Updated 5 years ago
DHRI-Curriculum / text-analysis
View on GitHub
@DHRI-Curriculum Session on text analysis with NLTK, including discussion of cleaning data, creating text corpora, and analyzing texts pr…
☆11May 13, 2021Updated 5 years ago
avalanchesiqi / networked-popularity
View on GitHub
Code and Data for paper: Estimating Attention Flow in Online Video Networks (CSCW '19)
☆12Nov 19, 2019Updated 6 years ago
maimemo / MaiMemoSimulator
View on GitHub
☆11Apr 20, 2023Updated 3 years ago
Alamantus / fun-word-list
View on GitHub
A collection of fun and interesting words in English used in the Insanity Jam's Game Idea Generator
☆13Sep 8, 2022Updated 3 years ago
franklindyer / BWA
View on GitHub
☆14Oct 23, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
parolteknologio / stt-esperanto
View on GitHub
Deepspeech/Coqui AI speech to text systems in Esperanto. - Parolrekoniloj en Esperanto uzante Deepspeech/Coqui Ai.
☆10Jan 11, 2022Updated 4 years ago
fbennett / legal-resource-registry
View on GitHub
The Legal Resource Registry has moved!
☆18Aug 21, 2019Updated 6 years ago
trevorld / Hanzi_Stats
View on GitHub
Anki plugin which calculates number of Hanzi you have learned so far.
☆18May 9, 2026Updated 2 months ago
dynamotn / stardict-vi
View on GitHub
Some Vietnamese dictionaries for StarDict, GoldenDict... from OVDP (Open Vietnamese Dictionary Project)
☆49Oct 26, 2022Updated 3 years ago
Zhiyu-Lei / Traffic-Sign-Detection-and-Information-Extraction
View on GitHub
Train YOLO object detection model to find traffic signs in the images. Use OCR pipeline to extract the information from the signs with te…
☆13Dec 26, 2020Updated 5 years ago
elazarg / nakdimon
View on GitHub
Hebrew Diacritizer
☆48Jul 10, 2026Updated last week
mush42 / mantoq
View on GitHub
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆16Mar 15, 2025Updated last year
sillsdev / khmer-character-specification
View on GitHub
Khmer Character Specification
☆27Mar 14, 2025Updated last year
mateusz1913 / tauri-and-expo-research
View on GitHub
☆21Aug 3, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Madoshakalaka / English-IPA
View on GitHub
English to IPA with syllable correspondence
☆13Aug 23, 2022Updated 3 years ago
c3l3si4n / revwhois
View on GitHub
CLI tool for discovering related base domains using WhoisXMLAPI's reverse Whois endpoints
☆12Jun 15, 2024Updated 2 years ago
adhadse / Deepdubpy
View on GitHub
A complete end-to-end Deep Learning system to generate high quality human like speech in English for Korean Drama (WIP)
☆13Sep 17, 2022Updated 3 years ago
verenablaschke / kindle-dict
View on GitHub
Create a Kindle dictionary from dict.cc data (specifically Norwegian/Bokmål 🇳🇴 -> German 🇩🇪).
☆11Oct 31, 2018Updated 7 years ago
woobe / rApps
View on GitHub
Repository for my R (Shiny) web applications.
☆22Aug 29, 2014Updated 11 years ago
byuflowlab / SixDOF.jl
View on GitHub
6-DOF nonlinear dynamic model (primarily for aircraft)
☆10Nov 16, 2021Updated 4 years ago
CAMeL-Lab / WIDH_2020_Arabic_Text_Analysis
View on GitHub
Material for the Text Analysis of Arabic course taught at the NYU Abu Dhabi Winter Institute in Digital Humanities 2020.
☆16Jan 30, 2020Updated 6 years ago
moblig / Trickest
View on GitHub
Custom Trickest Workflows
☆12Oct 26, 2023Updated 2 years ago
QCBSRworkshops / workshop08
View on GitHub
Workshop 8 - Generalized additive models (GAMs)
☆14Sep 3, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kbatsuren / wiktra
View on GitHub
Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)
☆37Jun 29, 2025Updated last year
GoFigure-LANL / VisHash
View on GitHub
Visual Hash for matching copies of visually similar images.
☆16Mar 17, 2025Updated last year
Ajatt-Tools / sub-transition
View on GitHub
🍩 Speed up the video if no subtitles are visible.
☆29Jan 4, 2024Updated 2 years ago
ropensci-archive / wellknown
View on GitHub
ARCHIVED WKT <-> GeoJSON
☆17Mar 30, 2023Updated 3 years ago
laurentlb / lingostories
View on GitHub
Free interactive stories for language learners
☆24May 1, 2026Updated 2 months ago
hms-dbmi / UpSetR-shiny
View on GitHub
A Shiny wrapper for the UpSetR R package (https://github.com/hms-dbmi/UpSetR).
☆21Aug 28, 2017Updated 8 years ago
urlquery / urlquery-cli
View on GitHub
A simple tool for interacting with urlquery.net from the command line
☆15Jul 5, 2025Updated last year