downloads and parses subtitle dataset from opensubtitles.org
ā15Apr 19, 2024Updated last year
Alternatives and similar repositories for Opensubtitles_dataset
Users that are interested in Opensubtitles_dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.ā15Jun 3, 2023Updated 2 years ago
- š Resource and Tool for Writing System Identification (Unicode 17.0) -- LREC 2024ā21Mar 29, 2026Updated 2 weeks ago
- Downloads 2020 English Wikipedia articles as plaintextā27Mar 25, 2023Updated 3 years ago
- URL downloader supporting checkpointing and continuous checksumming.ā19Nov 29, 2023Updated 2 years ago
- Haskell phonology library.ā10Jan 23, 2012Updated 14 years ago
- Wordpress hosting with auto-scaling - Free Trial ⢠AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Trigram files for 500+ languagesā25Mar 21, 2025Updated last year
- StyleGAN2 - Official TensorFlow Implementationā12Jul 15, 2020Updated 5 years ago
- phone inventory libraryā17May 15, 2023Updated 2 years ago
- A TinyStories LM with SAEs and transcodersā14Apr 3, 2025Updated last year
- Extensible DL-based automatic Arabic diacritization tool allowing the restoration of different types of diacritics.ā22Jul 25, 2023Updated 2 years ago
- ā95Jul 16, 2022Updated 3 years ago
- ā21Oct 20, 2022Updated 3 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.ā12Jul 5, 2021Updated 4 years ago
- SemEval 2020 task 10 datasetsā17Feb 19, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits ⢠AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Helper to use Plotly in SvelteKitā18Jul 12, 2022Updated 3 years ago
- The case study and multilingfual performance of ICASSP submissionā24Sep 24, 2022Updated 3 years ago
- ICU based universal language tokenizerā34Jan 19, 2022Updated 4 years ago
- StyleGAN2 - Official TensorFlow Implementationā25Sep 5, 2020Updated 5 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)ā36Jun 29, 2025Updated 9 months ago
- Visual Hash for matching copies of visually similar images.ā16Mar 17, 2025Updated last year
- A Chinese version of A Neural Parametric Singing Synthesizerā13Feb 12, 2022Updated 4 years ago
- MIDict (Multi-Index Dict) can be indexed by any "keys" or "values", suitable as a bidirectional/inverse dict or a multi-key/multi-value dā¦ā14May 19, 2016Updated 9 years ago
- Precise type-checker for JavaScriptā11Oct 23, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial ⢠AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Myanmar and Thai Language Resourcesā10Jul 18, 2022Updated 3 years ago
- A nuxt module to expose Vuex state in the browser URL for easy sharingā12Aug 28, 2017Updated 8 years ago
- A stylesheet based on Richard Rutter's book Web Typography.ā10Dec 6, 2018Updated 7 years ago
- Learned string similarity for entity names using optimal transport.ā35Nov 17, 2020Updated 5 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.ā29Mar 14, 2025Updated last year
- ā15Oct 4, 2024Updated last year
- Python wrapper for Google's syntaxnetā15Apr 8, 2019Updated 7 years ago
- The definitive collection of is* functions for runtime type checking. Lodash-compatible, tree-shakable, with types.ā17Jan 25, 2025Updated last year
- Dockerized version of Google's SyntaxNet Parser and POS tagger for Russian + standalone server.ā16May 4, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial ⢠AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ā37Jun 28, 2021Updated 4 years ago
- Library for fast text representation and classification.ā10Apr 17, 2022Updated 4 years ago
- šø GlotWeb: Web Indexing for Minority Languages (WWW 2026)ā17Feb 27, 2026Updated last month
- A menu and CLI based console program to play and write songs for the PC Speakerā15Aug 1, 2019Updated 6 years ago
- Rababa, the diacritization library for Arabic and Hebrew (Abjad scripts in general)ā13May 1, 2025Updated 11 months ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.ā14Jun 27, 2023Updated 2 years ago
- ā10Nov 14, 2016Updated 9 years ago