Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing
☆76Jan 22, 2026Updated last month
Alternatives and similar repositories for retrie
Users that are interested in retrie are comparing it to the libraries listed below
Sorting:
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 7 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆29Nov 18, 2025Updated 3 months ago
- MDLText☆12Jul 13, 2017Updated 8 years ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Google Tink's critical Ed25519 bug related to Java "final" keyword☆11Apr 5, 2020Updated 5 years ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year
- A companion repository to the "You Only Write Thrice: Creating Documents, Computational Notebooks and Presentations From a Single Source"…☆20Oct 14, 2022Updated 3 years ago
- หนังสือ "Interpretable Machine Learning" โดย Christoph Molnar ฉบับแปลภาษาไทย / Thai translation of "Interpretable Machine Learning" book…☆15Oct 15, 2021Updated 4 years ago
- Thai PDPA Website (Unofficial)☆11Jun 10, 2023Updated 2 years ago
- Tools for compiling corpora from Common Crawl☆14Nov 24, 2024Updated last year
- Read fixed width data files with Python 3☆14Nov 21, 2022Updated 3 years ago
- Standalone Dictionary-based, Maximum Matching + Thai Character Cluster (newmm) tokenizer extracted from PyThaiNLP☆13Jan 6, 2022Updated 4 years ago
- Slides for an opinionated talk about what it means to be a senior software engineer☆15Jun 17, 2023Updated 2 years ago
- Notes on papers in Natural Language Processing, Computational Linguistics, and the related sciences☆14Updated this week
- Plot charts from arbtt-stats to terminal☆17Jun 16, 2024Updated last year
- 📔️ Generate a text-based journal from a template file.☆21Mar 16, 2021Updated 4 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- Alternative robots parser module for Python☆21Updated this week
- A scikit-learn style implementation of NBSVM☆17Feb 12, 2016Updated 10 years ago
- Links to export personal data from popular internet services☆22Feb 4, 2024Updated 2 years ago
- Free Pull Request reminder for Github. Has configurations to post reminders to Slack and email along with jinja templating☆23Dec 8, 2022Updated 3 years ago
- Vinta's ESLint and Prettier shareable configs.☆23Feb 19, 2024Updated 2 years ago
- "Learning What is Essential in Questions", CoNLL, 2017☆26Aug 3, 2018Updated 7 years ago
- e-magyar text processing system -- inter-module communication via tsv + REST API☆31Aug 23, 2025Updated 6 months ago
- ChatGPT with access to the internet☆26Jun 16, 2023Updated 2 years ago
- Pure Rust port of CRFsuite: a fast implementation of Conditional Random Fields (CRFs)☆30Updated this week
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆186Jun 6, 2025Updated 9 months ago
- 🔢 Work with static vector models☆37Apr 21, 2025Updated 10 months ago
- Checklist para propostas de palestras para Python Brasil☆26Apr 1, 2019Updated 6 years ago
- A Directory of Online Newspaper Sources for 70+ Languages☆31Apr 15, 2021Updated 4 years ago
- Explainable AI for Software Engineering: A Hands-on Guide on How to Make Software Analytics More Practical, Explainable, and Actionable (…☆27Nov 14, 2021Updated 4 years ago
- Find your broken links, so users don't.☆66Dec 1, 2025Updated 3 months ago
- Small-vocabulary neural sequence-to-sequence generation with optional feature conditioning☆36Feb 28, 2026Updated last week
- ☆12Apr 23, 2018Updated 7 years ago
- Code base for the practitioner's guide to the ONC algorithm paper published with the Journal of Financial Data Science☆20Jun 8, 2023Updated 2 years ago
- 💫 A spaCy package for Yohei Tamura's Rust tokenizations library☆35Jun 3, 2025Updated 9 months ago
- calculate memory footprint of python objects☆29Aug 19, 2017Updated 8 years ago
- Joint sentence classification-Tensorflow☆33Sep 14, 2018Updated 7 years ago
- Wikidata Live Changes - Group Project - 2020☆10Apr 23, 2024Updated last year