commoncrawl / language-detection-cld2
Natural language detection, Java bindings for CLD2
☆14Updated 2 months ago
Alternatives and similar repositories for language-detection-cld2:
Users that are interested in language-detection-cld2 are comparing it to the libraries listed below
- Rust crate for entity parsing☆16Updated 2 years ago
- finalfusion embeddings in Rust☆94Updated last year
- Context-sensitive word embeddings with subwords. In Rust.☆86Updated last year
- Rust implementation of Duckling☆78Updated 3 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- Rust binding to crfsuite☆25Updated 2 years ago
- Lightning Fast Language Prediction 🚀☆165Updated 5 years ago
- Performance evaluation of nearest neighbor search using Vespa, Elasticsearch and Open Distro for Elasticsearch K-NN☆117Updated 3 years ago
- fastText Rust binding☆58Updated last year
- Fast English word segmentation in Rust☆94Updated last week
- Finds the likelihood that one string is a typo of another and generates likely typos from a given string☆61Updated 13 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆300Updated last year
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- allennlp + streamlit demo☆22Updated 5 years ago
- Search for similar short strings☆53Updated 4 years ago
- My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".☆51Updated last month
- Graph-based Approximate Nearest Neighbor Search☆317Updated 6 months ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 10 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 5 months ago
- Neural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.☆72Updated last year
- ☆62Updated last year
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆125Updated last month
- Rust wrapper for Microsoft's ONNX Runtime with CUDA support (version 1.7)☆24Updated 2 years ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆31Updated last year
- Simple NLP in Rust with Python bindings☆150Updated last year
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- Common Index File Format to to support interoperability between open-source IR engines☆31Updated 4 months ago
- A small tool that EXPLains spACY parse results. See what I did there?☆83Updated 2 years ago
- Succeeded by SyntaxDot: https://github.com/tensordot/syntaxdot☆25Updated 3 years ago