High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementation!
☆94Oct 16, 2024Updated last year
Alternatives and similar repositories for cyac
Users that are interested in cyac are comparing it to the libraries listed below
Sorting:
- Fast and thread safe C++11 implementation of of the Aho-Corasick algorithm.☆10Mar 4, 2020Updated 5 years ago
- ☆12Sep 30, 2022Updated 3 years ago
- ☆15Updated this week
- 第一個開放的客語斷詞工具☆13Jun 10, 2018Updated 7 years ago
- ☆14Jul 9, 2018Updated 7 years ago
- Nordlys: Toolkit for entity-oriented and semantic search☆31Mar 23, 2021Updated 4 years ago
- ☆52Oct 9, 2025Updated 4 months ago
- ☆14Sep 22, 2016Updated 9 years ago
- Code for the paper: Combining Graph Degeneracy and Submodularity for Unsupervised Extractive Summarization☆17Apr 24, 2020Updated 5 years ago
- ☆14Feb 26, 2022Updated 4 years ago
- 🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure. (Python wrapper for daachorse)☆20Mar 15, 2025Updated 11 months ago
- code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"☆39Apr 22, 2020Updated 5 years ago
- DeepCT and HDCT uses BERT to generate novel, context-aware bag-of-words term weights for documents and queries.☆325May 9, 2021Updated 4 years ago
- FlowDelta: Modeling Flow Information Gain in Reasoning for Conversational Machine Comprehension☆35Oct 4, 2022Updated 3 years ago
- FIGMENT☆15Jan 27, 2020Updated 6 years ago
- Vector search in Lucene based search attempting to use just the existing Lucene data structures (experimental)☆43Oct 29, 2019Updated 6 years ago
- Commonsense Inference on Events, Intents, and Reactions☆44Jun 14, 2019Updated 6 years ago
- hnsw implemented by python☆21Nov 28, 2019Updated 6 years ago
- This is a prototype of a multi-lingual suite for named-entity recognition in Python.☆21Apr 25, 2024Updated last year
- 医学预训练语言模型☆18Dec 17, 2020Updated 5 years ago
- 2019语言与智能技术竞赛-基于知识图谱的主动聊 天☆115May 24, 2019Updated 6 years ago
- Tool for parsing and converting various span encoding schemes.☆23Jan 13, 2024Updated 2 years ago
- Comparing LanceDB and Elasticsearch for full-text search and vector search performance☆29Feb 8, 2026Updated 3 weeks ago
- 一个基于Together AI的强大图像生成工具,支持文生图、图生图和提示词分析功能。☆24Nov 24, 2024Updated last year
- Google's Conceptual Captions Dataset translated into Korean☆23Aug 28, 2022Updated 3 years ago
- multiprocess unsupervised chinese_detect_words ngram_combination☆23Jan 2, 2019Updated 7 years ago
- Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"☆28Jul 31, 2024Updated last year
- A datasets and methods survey about task-oriented dialogue, including recent datasets and SOTA leaderboards.☆1,244Nov 8, 2022Updated 3 years ago
- Implementation of <Improving Neural Question Generation Using Answer Separation> by Yanghoon Kim et al., AAAI 2019☆52Jul 18, 2020Updated 5 years ago
- 开天-新词,中文新词发现工具,Chinese New Word Discovery Tool☆22Dec 5, 2019Updated 6 years ago
- ☆25Jun 1, 2016Updated 9 years ago
- ☆24Nov 29, 2017Updated 8 years ago
- cicada: a hypergraph-based toolkit for statistical machine translation based on {tree, string}-to-{tree, string} models☆42Aug 9, 2021Updated 4 years ago
- A curated list of papers dedicated to neural text (semantic) matching.☆779Dec 8, 2023Updated 2 years ago
- 使用BERT解决lic2019机器阅读理解☆89May 31, 2019Updated 6 years ago
- This code repository presents the pytorch implementation of the paper “Implicit Deep Latent Variable Models for Text Generation”(EMNLP 20…☆55Mar 11, 2022Updated 3 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Aug 21, 2025Updated 6 months ago
- This is a chinese Bert model specific for question answering☆27Aug 8, 2019Updated 6 years ago
- A pytorch implementation of Information Bottleneck GAN☆28Mar 6, 2019Updated 6 years ago