downloads and parses subtitle dataset from opensubtitles.org
β15Apr 19, 2024Updated 2 years ago
Alternatives and similar repositories for Opensubtitles_dataset
Users that are interested in Opensubtitles_dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [LREC 2024] π Resource and Tool for Writing System Identificationβ21Mar 29, 2026Updated 2 months ago
- Downloads 2020 English Wikipedia articles as plaintextβ27Mar 25, 2023Updated 3 years ago
- URL downloader supporting checkpointing and continuous checksumming.β19Nov 29, 2023Updated 2 years ago
- Haskell phonology library.β10Jan 23, 2012Updated 14 years ago
- Trigram files for 500+ languagesβ24Mar 21, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- phone inventory libraryβ17May 15, 2023Updated 3 years ago
- Dataset of Canada goose images with annotations of bounding boxes with object classes, suitable for testing object detection algorithms.β40Aug 2, 2018Updated 7 years ago
- Simple migration engine for Peeweeβ19Updated this week
- Extensible DL-based automatic Arabic diacritization tool allowing the restoration of different types of diacritics.β22Jul 25, 2023Updated 2 years ago
- β95Jul 16, 2022Updated 3 years ago
- Demo code for learning_text_transformerβ25Feb 22, 2015Updated 11 years ago
- SemEval 2020 task 10 datasetsβ17Feb 19, 2020Updated 6 years ago
- The case study and multilingfual performance of ICASSP submissionβ24Sep 24, 2022Updated 3 years ago
- ICU based universal language tokenizerβ34Jan 19, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)β36Jun 29, 2025Updated 11 months ago
- Python tools for processing the stackexchange data dumps into a text dataset for Language Modelsβ87Dec 6, 2023Updated 2 years ago
- simple kv store for streamsβ36Mar 14, 2013Updated 13 years ago
- A Chinese version of A Neural Parametric Singing Synthesizerβ13Feb 12, 2022Updated 4 years ago
- MIDict (Multi-Index Dict) can be indexed by any "keys" or "values", suitable as a bidirectional/inverse dict or a multi-key/multi-value dβ¦β14May 19, 2016Updated 10 years ago
- Precise type-checker for JavaScriptβ11Oct 23, 2025Updated 7 months ago
- A utility to read and write PDFs with Pythonβ12Apr 28, 2022Updated 4 years ago
- statically generated weekly digest of articles read in Pocketβ10May 14, 2019Updated 7 years ago
- A stylesheet based on Richard Rutter's book Web Typography.β10Dec 6, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.β30Mar 14, 2025Updated last year
- Creating super-parallel corpora of more than 1500+ unique languages for NLP researchβ34Dec 8, 2022Updated 3 years ago
- β15Oct 4, 2024Updated last year
- These are lists for a variety of languages containing words that are distinctive to each language.β42Apr 5, 2022Updated 4 years ago
- The definitive collection of is* functions for runtime type checking. Lodash-compatible, tree-shakable, with types.β17Jan 25, 2025Updated last year
- Dockerized version of Google's SyntaxNet Parser and POS tagger for Russian + standalone server.β16May 4, 2017Updated 9 years ago
- β37Jun 28, 2021Updated 4 years ago
- Library for fast text representation and classification.β10Apr 17, 2022Updated 4 years ago
- A menu and CLI based console program to play and write songs for the PC Speakerβ15Aug 1, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- β10Nov 14, 2016Updated 9 years ago
- JavaScript port of SymSpell for Node.jsβ13Sep 30, 2022Updated 3 years ago
- β12Mar 31, 2020Updated 6 years ago
- Doing style transfer with linguistic features using OpenAI's CLIP.β14May 4, 2021Updated 5 years ago
- Add screenshot button to youtube.comβ15Jun 22, 2018Updated 7 years ago
- β165Mar 5, 2021Updated 5 years ago
- A Font with extensive coverage of Unicode13 as of March 2020 (part of Unicode Fonts for Ancient Scripts)β18Mar 26, 2020Updated 6 years ago