Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing
☆76May 1, 2026Updated last month
Alternatives and similar repositories for retrie
Users that are interested in retrie are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆20Jul 5, 2024Updated last year
- A python library for easily querying morphological inflection models trained on Unimorph☆13Oct 23, 2022Updated 3 years ago
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 10 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆31Nov 18, 2025Updated 6 months ago
- Google Tink's critical Ed25519 bug related to Java "final" keyword☆11Apr 5, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- หนังสือ "Interpretable Machine Learning" โดย Christoph Molnar ฉบับแปลภาษาไทย / Thai translation of "Interpretable Machine Learning" book…☆15Oct 15, 2021Updated 4 years ago
- A public repository for corrupt0 datathon's court data☆11Jul 6, 2019Updated 6 years ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- Notes on papers in Natural Language Processing, Computational Linguistics, and the related sciences☆14May 28, 2026Updated last week
- Tools for compiling corpora from Common Crawl☆14Nov 24, 2024Updated last year
- Thai PDPA Website (Unofficial)☆11Jun 10, 2023Updated 3 years ago
- MDLText☆12Jul 13, 2017Updated 8 years ago
- Efficient string matching with regular expressions☆146Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Check to see if an SDist matches Git☆12Jun 1, 2026Updated last week
- Standalone Dictionary-based, Maximum Matching + Thai Character Cluster (newmm) tokenizer extracted from PyThaiNLP☆13Jan 6, 2022Updated 4 years ago
- Plot charts from arbtt-stats to terminal☆17Jun 16, 2024Updated last year
- Code and dataset for the EMNLP 2024 paper: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory☆51Sep 26, 2024Updated last year
- 📔️ Generate a text-based journal from a template file.☆21Mar 16, 2021Updated 5 years ago
- Slides for an opinionated talk about what it means to be a senior software engineer☆15Jun 17, 2023Updated 2 years ago
- Automatically exported from code.google.com/p/hunpos☆12Apr 9, 2018Updated 8 years ago
- GEDCOM 7 parser for Python☆15Nov 29, 2025Updated 6 months ago
- Alternative robots parser module for Python☆22Apr 8, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- KL3M training data collection and preprocessing☆22Apr 14, 2025Updated last year
- Convert an imscc file to a folder with all the content with proper structure☆11Jul 4, 2016Updated 9 years ago
- Source code accompanying the ICLR2020 publication 'Massively Multilingual Sparse Word Representations' https://openreview.net/forum?id=Hy…☆12Aug 15, 2023Updated 2 years ago
- A micro service that allows to compile *Tex-files via HTTP☆13Mar 11, 2018Updated 8 years ago
- Flake8 checker for raw literals inside raises.☆17Jun 1, 2026Updated last week
- Legal Code for the State of Utah☆44Apr 8, 2014Updated 12 years ago
- ☆43Mar 9, 2026Updated 3 months ago
- Project to enable search of key words in text files extracted by the Querido Diário.☆14Jul 15, 2020Updated 5 years ago
- Rcpp Bindings to FastAD Automatic Differentiation☆13Jan 19, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Unofficial faiss wheel builder for NVIDIA GPU☆36Apr 29, 2026Updated last month
- The code used to create and update the Open Australian Legal Embeddings, the first open-source embeddings of Australian legislative and j…☆14Feb 17, 2024Updated 2 years ago
- ChatGPT with access to the internet☆25Jun 16, 2023Updated 2 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- MCP Server für Deutsche Gesetzestexte☆46Dec 19, 2025Updated 5 months ago
- AUSTENDER OCDS Search API. This portal will provide users of AusTender data with documentation, code examples, bug notifications and feat…☆20Feb 12, 2024Updated 2 years ago
- code and data used to build a training dataset for dragnet models☆10Nov 29, 2020Updated 5 years ago