ddelange / retrieView external linksLinks
Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing
☆76Jan 22, 2026Updated 3 weeks ago
Alternatives and similar repositories for retrie
Users that are interested in retrie are comparing it to the libraries listed below
Sorting:
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆20Jul 5, 2024Updated last year
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 6 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆27Nov 18, 2025Updated 2 months ago
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- หนังสือ "Interpretable Machine Learning" โดย Christoph Molnar ฉบับแปลภาษาไทย / Thai translation of "Interpretable Machine Learning" book…☆15Oct 15, 2021Updated 4 years ago
- Thai PDPA Website (Unofficial)☆11Jun 10, 2023Updated 2 years ago
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆16Sep 20, 2023Updated 2 years ago
- Read fixed width data files with Python 3☆14Nov 21, 2022Updated 3 years ago
- Notes on papers in Natural Language Processing, Computational Linguistics, and the related sciences☆14Feb 4, 2026Updated last week
- Standalone Dictionary-based, Maximum Matching + Thai Character Cluster (newmm) tokenizer extracted from PyThaiNLP☆13Jan 6, 2022Updated 4 years ago
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- Plot charts from arbtt-stats to terminal☆17Jun 16, 2024Updated last year
- 📔️ Generate a text-based journal from a template file.☆21Mar 16, 2021Updated 4 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- Alternative robots parser module for Python☆20Jan 24, 2026Updated 3 weeks ago
- A scikit-learn style implementation of NBSVM☆17Feb 12, 2016Updated 10 years ago
- Free Pull Request reminder for Github. Has configurations to post reminders to Slack and email along with jinja templating☆22Dec 8, 2022Updated 3 years ago
- Links to export personal data from popular internet services☆22Feb 4, 2024Updated 2 years ago
- A thin wrapper around the DBpedia Spotlight HTTP API☆25Dec 2, 2017Updated 8 years ago
- Data anonymizer for Django☆28Feb 7, 2026Updated last week
- e-magyar text processing system -- inter-module communication via tsv + REST API☆31Aug 23, 2025Updated 5 months ago
- Processing the MPQA Corpus☆27Sep 22, 2018Updated 7 years ago
- Checklist para propostas de palestras para Python Brasil☆26Apr 1, 2019Updated 6 years ago
- A Directory of Online Newspaper Sources for 70+ Languages☆31Apr 15, 2021Updated 4 years ago
- Explainable AI for Software Engineering: A Hands-on Guide on How to Make Software Analytics More Practical, Explainable, and Actionable (…☆27Nov 14, 2021Updated 4 years ago
- Check what PyPI dependencies changed and when.☆30Feb 2, 2026Updated last week
- 💫 A spaCy package for Yohei Tamura's Rust tokenizations library☆34Jun 3, 2025Updated 8 months ago
- calculate memory footprint of python objects☆29Aug 19, 2017Updated 8 years ago
- Joint sentence classification-Tensorflow☆33Sep 14, 2018Updated 7 years ago
- Wikidata Live Changes - Group Project - 2020☆10Apr 23, 2024Updated last year
- Southeast Asian layout task force☆36May 31, 2025Updated 8 months ago
- Code accompanying Coling2020 publication on data augmentation for named entity recognition☆34Aug 4, 2021Updated 4 years ago
- Shows top suspects for memory leaks in your Python program.☆82Jul 13, 2022Updated 3 years ago
- An initiative for Bangkokians to develop contributable open-source projects to solve local problems!☆38Feb 1, 2023Updated 3 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆170Nov 7, 2022Updated 3 years ago
- ☆40Feb 1, 2023Updated 3 years ago
- Areal images sourced from the FIS-Broker, City of Berlin.☆12Nov 10, 2025Updated 3 months ago
- Hungarian tokenizer.☆14Mar 15, 2022Updated 3 years ago
- Notes and samples for Python performance talk☆10Feb 17, 2022Updated 3 years ago