downloads and parses subtitle dataset from opensubtitles.org
☆15Apr 19, 2024Updated last year
Alternatives and similar repositories for Opensubtitles_dataset
Users that are interested in Opensubtitles_dataset are comparing it to the libraries listed below
Sorting:
- 🖋 Resource and Tool for Writing System Identification (Unicode 17.0) -- LREC 2024☆21Feb 17, 2026Updated 2 weeks ago
- Extensible DL-based automatic Arabic diacritization tool allowing the restoration of different types of diacritics.☆21Jul 25, 2023Updated 2 years ago
- Trigram files for 500+ languages☆25Mar 21, 2025Updated 11 months ago
- Downloads 2020 English Wikipedia articles as plaintext☆27Mar 25, 2023Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆34Jun 29, 2025Updated 8 months ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- With Slidable you can swipe any widget to perform an action on swipe in your Flutter app.☆11Nov 24, 2020Updated 5 years ago
- ☆94Jul 16, 2022Updated 3 years ago
- ☆13Jan 25, 2021Updated 5 years ago
- Search for AppImage releases over the web.☆12Oct 25, 2018Updated 7 years ago
- Precise type-checker for JavaScript☆11Oct 23, 2025Updated 4 months ago
- A stylesheet based on Richard Rutter's book Web Typography.☆10Dec 6, 2018Updated 7 years ago
- Library for fast text representation and classification.☆10Apr 17, 2022Updated 3 years ago
- Qubes OS + Borg + Rsync.net backup strategy scripts☆16Aug 3, 2017Updated 8 years ago
- A Font with extensive coverage of Unicode13 as of March 2020 (part of Unicode Fonts for Ancient Scripts)☆16Mar 26, 2020Updated 5 years ago
- Python asynchronous library for web scrapping☆10Aug 24, 2021Updated 4 years ago
- Simple CLI frontend for flashcards-core☆12Jul 30, 2021Updated 4 years ago
- Rababa, the diacritization library for Arabic and Hebrew (Abjad scripts in general)☆13May 1, 2025Updated 10 months ago
- Tunisian Arabish Corpus☆12Mar 12, 2024Updated last year
- Myanmar and Thai Language Resources☆10Jul 18, 2022Updated 3 years ago
- A parallel corpus of Sorani, Kurmanji and English☆15Oct 6, 2020Updated 5 years ago
- ☆10Mar 24, 2021Updated 4 years ago
- An abstract, safe, and concise color conversion library for rust nightly This requires the feature adt_const_params☆12Nov 18, 2022Updated 3 years ago
- No-nonsense simple transliteration between writing systems, mostly of Semitic origin☆13Jun 29, 2025Updated 8 months ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Jun 3, 2023Updated 2 years ago
- A text file containing English words, along with the definition, parts of speech (noun,verb,adjective,etc.), and a link to the url where …☆13Apr 27, 2024Updated last year
- Course Repository for Udemy Course: xxxx☆13Dec 7, 2024Updated last year
- An reddit bot that displays useful GPA/course information in response to reddit topics/comments☆11Oct 5, 2025Updated 5 months ago
- ☆12Oct 10, 2020Updated 5 years ago
- ☆13Jan 29, 2024Updated 2 years ago
- simple kv store for streams☆36Mar 14, 2013Updated 12 years ago
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated 11 months ago
- Collection of swadesh lists in CSV table format with possible connections to Indo European☆14Aug 31, 2025Updated 6 months ago
- A menu and CLI based console program to play and write songs for the PC Speaker☆15Aug 1, 2019Updated 6 years ago
- I Wish To ... a command line magic tool using LLM (via OpenAI API)☆12Jul 17, 2023Updated 2 years ago
- My own rendition of pihole on docker swarm☆11Aug 1, 2024Updated last year
- JavaScript port of SymSpell for Node.js☆13Sep 30, 2022Updated 3 years ago
- Opinionated boilerplate supported by Lambda Apollo Server, Mongoose, Next.js, React Apollo and emotion☆18Jan 3, 2023Updated 3 years ago
- ☆12Aug 24, 2021Updated 4 years ago