Most common sentences and words for all languages in the OpenSubtitles2018 corpus with Python code
☆61Feb 8, 2025Updated last year
Alternatives and similar repositories for top-open-subtitles-sentences
Users that are interested in top-open-subtitles-sentences are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- temporary files created by opensubtitles-scraper☆17Feb 3, 2026Updated 2 months ago
- NGRAMS is a search engine for the Google Books Ngram Dataset. This repository contains documentation, discussions, announcements, and iss…☆24Dec 31, 2025Updated 4 months ago
- Word/n-gram frequency lists for the Google Books Ngram Corpus (v3, all languages) with Python code☆105Aug 14, 2023Updated 2 years ago
- LibreTranslate in golang☆28May 13, 2021Updated 4 years ago
- ☆13Mar 10, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Practical example from Human-in-the-Loop Machine Learning book☆11Oct 28, 2021Updated 4 years ago
- An isolated environment for DNS cache poisoning attack investigation and demonstration.☆10Nov 22, 2020Updated 5 years ago
- An advanced tracking plug-in for reveal.js for purposes like Learning Analytics☆11Jan 18, 2023Updated 3 years ago
- A Terminal.app for ravynOS☆20Oct 25, 2025Updated 6 months ago
- The largest open source arabic words list☆16Oct 18, 2021Updated 4 years ago
- A collection of fun and interesting words in English used in the Insanity Jam's Game Idea Generator☆13Sep 8, 2022Updated 3 years ago
- NILC-Metrix gathers the metrics developed over more than a decade in NILC Lab.☆15Feb 23, 2026Updated 2 months ago
- Material for the Text Analysis of Arabic course taught at the NYU Abu Dhabi Winter Institute in Digital Humanities 2020.☆15Jan 30, 2020Updated 6 years ago
- Library of Intuitive Ordinal Notations (IONs)☆11May 8, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Dec 8, 2022Updated 3 years ago
- Beautiful animated SVG or GIF kanji from KanjiVG data set.☆73Jul 16, 2016Updated 9 years ago
- ☆19Sep 4, 2025Updated 7 months ago
- Haskell interval collections☆17May 5, 2025Updated 11 months ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆12Oct 28, 2022Updated 3 years ago
- Annotation layer for PDF.js. Forked and modified from Submitty's branch.☆18Nov 9, 2022Updated 3 years ago
- Rust library for access to the JMdict☆16Mar 16, 2024Updated 2 years ago
- CLI tool for discovering related base domains using WhoisXMLAPI's reverse Whois endpoints☆12Jun 15, 2024Updated last year
- ☆25Aug 19, 2025Updated 8 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Create a Kindle dictionary from dict.cc data (specifically Norwegian/Bokmål 🇳🇴 -> German 🇩🇪).☆11Oct 31, 2018Updated 7 years ago
- Train YOLO object detection model to find traffic signs in the images. Use OCR pipeline to extract the information from the signs with te…☆12Dec 26, 2020Updated 5 years ago
- Workshop 8 - Generalized additive models (GAMs)☆14Sep 3, 2024Updated last year
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆36Jun 29, 2025Updated 10 months ago
- Visual Hash for matching copies of visually similar images.☆16Mar 17, 2025Updated last year
- Unveiling Cyber Threats: From assets to Vulnerability Insights☆17Oct 22, 2024Updated last year
- 间隔重复模拟器☆20May 22, 2025Updated 11 months ago
- Official repo for the NCR Crypto Meetup☆17Jun 1, 2022Updated 3 years ago
- Materials for "Prompting is not a substitute for probability measurements in large language models" (EMNLP 2023)☆24Oct 24, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Global ASP - African Storybook Project for the World☆18Dec 1, 2025Updated 4 months ago
- simple kv store for streams☆36Mar 14, 2013Updated 13 years ago
- Needleman-Wunsch and Hirschberg algorithms☆13Aug 26, 2020Updated 5 years ago
- A BugBounty playbook covering vulnerability bypasses, payloads, and quick checks for OWASP Top 10 + extras.☆23Sep 29, 2025Updated 7 months ago
- A Chrome extension that automatically scans web pages and internal links for user-defined keywords, storing results and sending notificat…☆25Sep 28, 2025Updated 7 months ago
- A LOGO Turtle library for Processing.☆13Apr 26, 2019Updated 7 years ago
- ☆15Aug 30, 2021Updated 4 years ago