daac-tools / python-daachorse
๐ A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure. (Python wrapper for daachorse)
โ15Updated last year
Related projects โ
Alternatives and complementary repositories for python-daachorse
- Rust implementation of SIF and uSIF: Simple and fast sentence embeddingโ19Updated 11 months ago
- AllenNLP integration for Shiba: Japanese CANINE modelโ12Updated 3 years ago
- The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)โ33Updated this week
- โ18Updated last month
- Funer is Rule based Named Entity Recognition tool.โ22Updated 2 years ago
- ๐ฆ Rust library of natural language dictionaries using character-wise double-array tries.โ28Updated last year
- A library for semantic similarity searchโ23Updated 2 months ago
- Utility scripts for preprocessing Wikipedia texts for NLPโ76Updated 7 months ago
- Use custom tokenizers in spacy-transformersโ16Updated 2 years ago
- ๆฅๆฌ่ชใใญในใใซๅฏพใใ wikification ใฎใใใฎใฝใใใฆใงใขโ15Updated 7 years ago
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpusโ16Updated 3 years ago
- Rust library providing fast language model queries in compressed spaceโ23Updated 2 years ago
- Testing tool to verify the search qualities of the Elasticsearch indicesโ29Updated last year
- This is the repository for TRF (text readability features) publication.โ39Updated 5 years ago
- Code for COLING 2020 Paperโ13Updated this week
- Repository of ACL2023 paper: Unbalanced Optimal Transport for Unbalanced Word Alignmentโ36Updated last year
- Yada is a yet another double-array trie library aiming for fast search and compact data representation.โ31Updated 8 months ago
- Finding all pairs of similar documents time- and memory-efficientlyโ58Updated 2 years ago
- Annotated Fuman Kaitori Center Corpusโ17Updated 10 months ago
- A Japanese dependency parser based on BERTโ22Updated 2 years ago
- โ18Updated 5 months ago
- Codes to pre-train Japanese T5 modelsโ40Updated 3 years ago
- ๐ Colt: Effortlessly configure and construct Python objects with colt, a lightweight library inspired by AllenNLP and Tangoโ24Updated last week
- Python Implementation of EmbedRankโ49Updated 5 years ago
- โ24Updated this week
- โ11Updated 2 months ago
- Edit and create Kubernetes job from cronjob template using your EDITORโ15Updated 3 months ago
- ๐ Implementation of Neural Network based Named Entity Recognizer (Lample+, 2016) using Chainer.โ45Updated last year
- Yet another sentence-level tokenizer for the Japanese textโ22Updated 2 years ago
- ๅฐ้็จ่ชๆฝๅบใขใซใดใชใบใ ใฎๅฎ่ฃ ใฎ็ทด็ฟโ18Updated 6 years ago