commoncrawl / language-detection-cld2Links
Natural language detection, Java bindings for CLD2
☆14Updated last month
Alternatives and similar repositories for language-detection-cld2
Users that are interested in language-detection-cld2 are comparing it to the libraries listed below
Sorting:
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- Java port of SymSpell: 1 million times faster through Symmetric Delete spelling correction algorithm☆67Updated 4 months ago
- Various utilities regarding Levenshtein transducers. (Java)☆58Updated 3 years ago
- Search relevance evaluation toolkit☆74Updated 3 years ago
- A fast and simple JavaScript library specifically targeted at collecting search and search related browser events.☆43Updated this week
- Querqy for Elasticsearch☆47Updated last week
- A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.☆45Updated 3 years ago
- Set of Jupyter notebooks demonstrating Learning to Rank integrated with Solr and Elasticsearch☆173Updated 8 months ago
- CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop☆37Updated 11 months ago
- Query preprocessor for Java-based search engines (Querqy Core and Lucene implementation)☆188Updated last week
- Lightning Fast Language Prediction 🚀☆167Updated 3 months ago
- NLP framework for JVM languages.☆152Updated 4 years ago
- Tools and other things for people who work on search relevance & information retrieval☆87Updated 2 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- CuVS integration for Lucene☆37Updated 5 months ago
- A text tagger based on Lucene / Solr, using FST technology☆177Updated last year
- Rust implementation of Duckling☆79Updated 4 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- Lucene for Information Retrieval☆50Updated 2 years ago
- A Java library for byte pattern matching and searching☆41Updated 4 years ago
- Hardened Fork of Ranklib learning to rank library☆45Updated 3 years ago
- Context-sensitive word embeddings with subwords. In Rust.☆88Updated 2 years ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆117Updated 4 years ago
- Java client for spaCy and more.☆15Updated 3 years ago
- Rust library for parsing and resolving entity values based on a gazetteer☆15Updated last year
- Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, …☆23Updated 4 years ago
- ☆17Updated 9 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Search relevance evaluation toolkit☆34Updated 3 years ago
- provide preprocessing platform for Lucene indexing and comprehensive Learning-to-Rank modules☆13Updated 7 years ago