Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing
☆76May 1, 2026Updated last month
Alternatives and similar repositories for retrie
Users that are interested in retrie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆20Jul 5, 2024Updated last year
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 11 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆31Nov 18, 2025Updated 7 months ago
- หนังสือ "Interpretable Machine Learning" โดย Christoph Molnar ฉบับแปลภาษาไทย / Thai translation of "Interpretable Machine Learning" book…☆15Oct 15, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A public repository for corrupt0 datathon's court data☆11Jul 6, 2019Updated 6 years ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- Notes on papers in Natural Language Processing, Computational Linguistics, and the related sciences☆14Updated this week
- A companion repository to the "You Only Write Thrice: Creating Documents, Computational Notebooks and Presentations From a Single Source"…☆20Oct 14, 2022Updated 3 years ago
- The home repository of the NerKor corpus, a Hungarian gold standard named entity annotated corpus containing 1 million tokens.☆16Sep 20, 2023Updated 2 years ago
- Efficient string matching with regular expressions☆146Jun 18, 2026Updated last week
- Standalone Dictionary-based, Maximum Matching + Thai Character Cluster (newmm) tokenizer extracted from PyThaiNLP☆13Jan 6, 2022Updated 4 years ago
- Read fixed width data files with Python 3☆14Mar 20, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Slides for an opinionated talk about what it means to be a senior software engineer☆15Jun 17, 2023Updated 3 years ago
- Automatically exported from code.google.com/p/hunpos☆12Apr 9, 2018Updated 8 years ago
- GEDCOM 7 parser for Python☆16Nov 29, 2025Updated 7 months ago
- ☆13Oct 20, 2022Updated 3 years ago
- Alternative robots parser module for Python☆22Jun 19, 2026Updated last week
- KL3M training data collection and preprocessing☆22Apr 14, 2025Updated last year
- Convert an imscc file to a folder with all the content with proper structure☆11Jul 4, 2016Updated 9 years ago
- A micro service that allows to compile *Tex-files via HTTP☆13Mar 11, 2018Updated 8 years ago
- Flake8 checker for raw literals inside raises.☆17Jun 22, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Lightweight pip dependency resolver with deptree preview functionality based on the PubGrub algorithm☆210May 1, 2026Updated 2 months ago
- Python bindings for the PCRE2 library created by Philip Hazel☆18Jun 17, 2026Updated 2 weeks ago
- ☆44Mar 9, 2026Updated 3 months ago
- Translation of query languages to serialized KoralQuery protocol☆15Jun 4, 2026Updated 3 weeks ago
- Starter repo for regl explorations☆10May 26, 2017Updated 9 years ago
- Project to enable search of key words in text files extracted by the Querido Diário.☆14Jul 15, 2020Updated 5 years ago
- e-magyar text processing system -- inter-module communication via tsv + REST API☆31Aug 23, 2025Updated 10 months ago
- Allows manual adding and editon of Timetracking Entries☆21May 18, 2021Updated 5 years ago
- The code used to create and update the Open Australian Legal Embeddings, the first open-source embeddings of Australian legislative and j…☆14Feb 17, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- JSON encoding/decoding for Numpy arrays and scalars☆23Jun 30, 2025Updated last year
- Scan using a network scanner with eSCL protocol (e.g. Canon PIXMA)☆16Mar 26, 2020Updated 6 years ago
- MCP Server für Deutsche Gesetzestexte☆46Dec 19, 2025Updated 6 months ago
- Magyar morfológiai generátor☆16Dec 12, 2025Updated 6 months ago
- A Directory of Online Newspaper Sources for 70+ Languages☆32Apr 15, 2021Updated 5 years ago
- A command line file encryption tool☆10Dec 1, 2019Updated 6 years ago