DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.
☆305Jun 11, 2024Updated last year
Alternatives and similar repositories for DAWG
Users that are interested in DAWG are comparing it to the libraries listed below
Sorting:
- Pure-python reader for DAWGs created by dawgdic C++ library or DAWG Python extension.☆50Sep 11, 2023Updated 2 years ago
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.☆1,123Dec 12, 2025Updated 2 months ago
- HAT-Trie for Python☆87Feb 8, 2016Updated 10 years ago
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆546Jan 6, 2026Updated last month
- Python module (C extension and plain python) implementing DAWG☆21Jan 6, 2022Updated 4 years ago
- Library for computing Deterministic Acyclic Finite State Automata (DAFSA)☆27Feb 18, 2023Updated 3 years ago
- Python package for lexicon; Trie and DAWG implementation.☆56Feb 23, 2026Updated last week
- MARISA: Matching Algorithm with Recursively Implemented StorAge☆595Feb 11, 2026Updated 2 weeks ago
- A platform for collecting, analyzing, and visualizing social media data.☆13Dec 27, 2020Updated 5 years ago
- An Efficient Language Model Using Double-Array Structures☆17Aug 10, 2020Updated 5 years ago
- A set of distinct value estimators that give probabilistic bounds on a sets cardinality☆22Dec 9, 2019Updated 6 years ago
- Morphological analyzer / inflection engine for Russian and Ukrainian languages.☆1,166Jun 26, 2024Updated last year
- Examples of spark-lucenerdd☆15Oct 6, 2023Updated 2 years ago
- Large silver standart Russian corpus with NER, morphology and syntax markup☆73Jul 24, 2023Updated 2 years ago
- COrpus based Morphological Analyzer with INtegrated User dictionary☆21Mar 30, 2025Updated 11 months ago
- An atomic class that guarantees atomic updates to its contained value.☆24Aug 15, 2018Updated 7 years ago
- A multi-language segmenter using high-order CRF.☆17Feb 27, 2020Updated 6 years ago
- Tools for building a Lucene index for Semantic Vectors☆21Jul 16, 2015Updated 10 years ago
- python3 package supporting efficient storage and querying of sets of sets using the trie data structure. Supports finding all the superse…☆23Sep 15, 2023Updated 2 years ago
- Python library implementing a trie data structure.☆824Apr 10, 2021Updated 4 years ago
- [deprecated] reference code for string segmentation using LSTM(tensorflow)☆19Feb 19, 2020Updated 6 years ago
- Sub-Character Representation Learning☆25May 28, 2018Updated 7 years ago
- Retrofitting Word Vectors to Semantic Lexicons☆375Apr 9, 2019Updated 6 years ago
- A comparison of various moving window median algorithms☆17Jun 4, 2011Updated 14 years ago
- A C++ library providing fast language model queries in compressed space.☆132Feb 25, 2023Updated 3 years ago
- Pyed Piper tool by Toby Rosen at Sony Imageworks converted to Python 3☆35Dec 7, 2021Updated 4 years ago
- Analytic UIMA pipelines using Spark☆24Nov 27, 2015Updated 10 years ago
- A collection of various discourse segmenters☆10Jun 30, 2017Updated 8 years ago
- INTERVAL field for PostgreSQL (and an approximation for other backends)☆21Jul 27, 2023Updated 2 years ago
- Temporal and Causal Reasoning (dataset)☆10Apr 19, 2022Updated 3 years ago
- Prospective search for python☆26Dec 4, 2012Updated 13 years ago
- Hadoop integration code for working with with Apache cTAKES☆10Feb 11, 2014Updated 12 years ago
- [2007] Windows tool, offers the ability to dynamically and transparently modify incoming and outgoing network traffic, as well as to redi…☆12Nov 27, 2017Updated 8 years ago
- Tool for tagging FLV files☆12Nov 2, 2015Updated 10 years ago
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Dec 6, 2013Updated 12 years ago
- String interpolation with printf syntax☆11Jun 21, 2015Updated 10 years ago
- A pytest plugin to add markers based on fixtures used.☆14Nov 13, 2022Updated 3 years ago
- A small java library for NLP Interchange Format (NIF) for NER(D) systems☆10Sep 13, 2022Updated 3 years ago
- Rule-based token, sentence segmentation for Russian language☆278Jul 24, 2023Updated 2 years ago