Snowball compiler and stemming algorithms
☆841Mar 13, 2026Updated last week
Alternatives and similar repositories for snowball
Users that are interested in snowball are comparing it to the libraries listed below
Sorting:
- Like HyperLogLog, but slower 🛶☆10Feb 5, 2020Updated 6 years ago
- Implementation Project for relation extraction in NLP using kernel based methods.☆33Aug 10, 2016Updated 9 years ago
- ☆16Dec 19, 2022Updated 3 years ago
- CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.☆10,062Feb 10, 2026Updated last month
- Apache OpenNLP☆1,586Updated this week
- A CRF based Chinese Named-entity Recognition Library written in Rust☆14Jan 23, 2021Updated 5 years ago
- Clojure bindings to Apache Tika project☆24Jul 4, 2013Updated 12 years ago
- Resurrection of the EuLisp definition and the Youtoo, EuXLisp and Eu2C implementations☆64Feb 27, 2011Updated 15 years ago
- C implementation of a compressed trie lookup map☆23May 14, 2019Updated 6 years ago
- A subfield of the complex numbers for exact calculation.☆21May 22, 2020Updated 5 years ago
- Library for fast text representation and classification.☆26,504Mar 22, 2024Updated last year
- The Heirloom Documentation Tools: troff, nroff, and related utilities☆147Aug 11, 2024Updated last year
- Interned string and more for rust☆10Dec 29, 2021Updated 4 years ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,352Updated this week
- Example of om.next end to end app with frontend communicating with backend☆12Mar 17, 2016Updated 10 years ago
- The approximate regex matching library and agrep command line tool.☆878Jan 8, 2026Updated 2 months ago
- The Solr Package Directory and Sanctuary☆13Oct 14, 2025Updated 5 months ago
- PostgreSQL extension (in C) to expose functionality from the ICU library☆32Jan 2, 2026Updated 2 months ago
- Python functions for popular relevance metrics (ndcg, err, etc)☆17Jul 28, 2023Updated 2 years ago
- 🦔 Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that runs on a few MBs of RAM.☆21,153Feb 17, 2026Updated last month
- provides a common interface to many IR measure tools☆97Feb 17, 2026Updated last month
- PISA: Performant Indexes and Search for Academia☆1,045Feb 16, 2026Updated last month
- Berkeley YACC (git mirror)☆15Mar 1, 2015Updated 11 years ago
- Zed UI – cross-platform graphical user interfaces☆11Jun 25, 2019Updated 6 years ago
- Implementation (in progress) of Dieng et al.'s TopicRNN intended to be used as a baseline and starting point.☆10Jun 26, 2018Updated 7 years ago
- Topic Modelling for Humans☆16,375Nov 1, 2025Updated 4 months ago
- Provide a reference implementation for the TTM programming language.☆10Oct 5, 2014Updated 11 years ago
- RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It…☆9,620Jan 22, 2026Updated last month
- A time-series database for high-performance real-time analytics packaged as a Postgres extension☆22,143Updated this week
- Improve your OpenSearch, Elasticsearch, Solr, Vectara, Algolia and Custom Search search quality.☆338Updated this week
- Documentation for Macro SPITBOL☆13Jan 28, 2026Updated last month
- Represent large sets and maps compactly with finite state transducers.☆2,053Sep 25, 2024Updated last year
- postgresql.conf comparison tool☆16Oct 28, 2025Updated 4 months ago
- Prime number testing and generation in R☆10Jan 16, 2024Updated 2 years ago
- Original Joy☆11Dec 17, 2024Updated last year
- mimalloc is a compact general purpose allocator with excellent performance.☆12,612Updated this week
- Demo re-implementation of the Hadoop MapReduce scheduler in Python☆13Mar 1, 2016Updated 10 years ago
- Common Lisp RabbitMQ client based on IOLib☆15Nov 5, 2023Updated 2 years ago
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,892Feb 9, 2026Updated last month