wissam-sib / universal-sentence-encoder-java
Convert sentences to fixed size embedding in Java
☆11Updated 4 years ago
Alternatives and similar repositories for universal-sentence-encoder-java:
Users that are interested in universal-sentence-encoder-java are comparing it to the libraries listed below
- A Recurrent Neural Network for classifying the grammaticality of English sentences☆13Updated 11 years ago
- a small, non-commercial, fair-use subset of the Penn-Treebank, in JSON.☆15Updated 6 years ago
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆76Updated last year
- A web interface to understand language-specific BERT-models☆17Updated 11 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆44Updated last year
- Scorer for grammatical error correction systems.☆14Updated 9 years ago
- Evaluation tools shared across anserini, pyserini, and pygaggle☆31Updated 2 months ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- Bilingual sentence similarity classifier using Tensorflow☆21Updated 5 years ago
- CRFs based Chinese word segmentor☆19Updated 10 years ago
- provide preprocessing platform for Lucene indexing and comprehensive Learning-to-Rank modules☆13Updated 7 years ago
- SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex☆19Updated 2 years ago
- Vector search in Lucene based search attempting to use just the existing Lucene data structures (experimental)☆43Updated 5 years ago
- ☆11Updated 7 years ago
- Code for the EMNLP 2020 paper titled "Chapter Captor: Text Segmentation in Novels"☆30Updated 4 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆85Updated 3 years ago
- An easy-to-use library to linguistically compare one sentence and its words to another, in the same language or a different one. For inst…☆22Updated 3 years ago
- Model for learning document embeddings along with their uncertainties☆35Updated last year
- Supporting example for "A Rust SentencePiece implementation"☆18Updated 4 years ago
- My NER Experiments with ModernBERT☆18Updated 2 months ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning☆27Updated 3 years ago
- A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.☆43Updated 2 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆16Updated 6 years ago
- Lightweight method based on shortest path on word graphs and NLP to generate single sentence summaries that highly relevant and grammatic…☆19Updated 8 years ago
- StAtutory Reasoning Assessment☆13Updated 2 years ago
- benchmarks for evaluating MT models☆12Updated 9 months ago
- ☆29Updated 2 years ago
- Code for "Contextualized Embeddings in Named-Entity Recognition", ECIR 2020☆13Updated 8 months ago
- Data collection, alignment and TAUS repository☆23Updated 7 years ago