Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code
☆63Feb 8, 2025Updated last year
Alternatives and similar repositories for top-open-subtitles-sentences
Users that are interested in top-open-subtitles-sentences are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- temporary files created by opensubtitles-scraper☆17Feb 3, 2026Updated 4 months ago
- Automatically load (Japanese) subtitles in MPV☆13Jan 13, 2026Updated 4 months ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆108Aug 14, 2023Updated 2 years ago
- ☆13May 27, 2026Updated 2 weeks ago
- An Anki plugin to sort your new cards.☆25Dec 16, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆17Oct 27, 2025Updated 7 months ago
- A collection of fun and interesting words in English used in the Insanity Jam's Game Idea Generator☆13Sep 8, 2022Updated 3 years ago
- NILC-Metrix gathers the metrics developed over more than a decade in NILC Lab.☆18May 23, 2026Updated 2 weeks ago
- An implementation of figlet written in Python☆14Sep 20, 2019Updated 6 years ago
- @DHRI-Curriculum Session on text analysis with NLTK, including discussion of cleaning data, creating text corpora, and analyzing texts pr…☆11May 13, 2021Updated 5 years ago
- Material for the Text Analysis of Arabic course taught at the NYU Abu Dhabi Winter Institute in Digital Humanities 2020.☆16Jan 30, 2020Updated 6 years ago
- Extract plain text from Arabic Wikipedia dumps.☆13Jun 15, 2014Updated 11 years ago
- ☆14Oct 23, 2025Updated 7 months ago
- Masking Strategies for Background Bias Removal in Computer Vision Models (ICCVW OODCV 2023 paper)☆16Jul 3, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Jun 3, 2026Updated last week
- World CIDR IP lists☆10Jan 28, 2026Updated 4 months ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆12Oct 28, 2022Updated 3 years ago
- Annotation layer for PDF.js. Forked and modified from Submitty's branch.☆18Nov 9, 2022Updated 3 years ago
- This script allow to scrape shodan.io IoT search engine and get devices IP without using your search or download credit!☆12May 26, 2021Updated 5 years ago
- Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"☆16Sep 1, 2022Updated 3 years ago
- Hebrew Diacritizer☆49May 20, 2026Updated 3 weeks ago
- 인공위성 영상에서 건물과 도로를 탐지하기 위해 딥러닝(Semantic Segmentation, Instance Segmentation)을 사용한 SIA 기업협력 프로젝트입니다.☆10Jun 20, 2022Updated 3 years ago
- Khmer Character Specification☆27Mar 14, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CLI tool for discovering related base domains using WhoisXMLAPI's reverse Whois endpoints☆12Jun 15, 2024Updated last year
- generates unique subdomain names and runs httpx on them☆17Apr 8, 2024Updated 2 years ago
- 6-DOF nonlinear dynamic model (primarily for aircraft)☆10Nov 16, 2021Updated 4 years ago
- Train YOLO object detection model to find traffic signs in the images. Use OCR pipeline to extract the information from the signs with te…☆13Dec 26, 2020Updated 5 years ago
- Workshop 8 - Generalized additive models (GAMs)☆14Sep 3, 2024Updated last year
- Tools for calculating psycholinguistically-relevant metrics of language statistics using transformer language models☆13Nov 11, 2022Updated 3 years ago
- A bot that make "bracket memes".☆16Oct 29, 2023Updated 2 years ago
- Fire-AV is a collection of lists that you can use to block av providers and bad ips☆24Updated this week
- Materials for "Prompting is not a substitute for probability measurements in large language models" (EMNLP 2023)☆24Oct 24, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Global ASP - African Storybook Project for the World☆18Dec 1, 2025Updated 6 months ago
- ☆18Sep 10, 2021Updated 4 years ago
- A complete Dockerized Django project that is orchestrated by docker-compose that also includes some extra services such as Postgres and R…☆16Mar 16, 2026Updated 2 months ago
- A simple bug bounty utility tool to remove uninteresting entries from a list of URLs.☆13Jul 22, 2024Updated last year
- simple kv store for streams☆36Mar 14, 2013Updated 13 years ago
- Needleman-Wunsch and Hirschberg algorithms☆13Aug 26, 2020Updated 5 years ago
- Zurich Morphological Lexicon for German: a tool to extract a morphological lexicon from Wiktionary☆12Aug 10, 2023Updated 2 years ago