wissam-sib / universal-sentence-encoder-javaLinks
Convert sentences to fixed size embedding in Java
☆11Updated 5 years ago
Alternatives and similar repositories for universal-sentence-encoder-java
Users that are interested in universal-sentence-encoder-java are comparing it to the libraries listed below
Sorting:
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆26Updated 8 months ago
- A Mixed Trie and Levenshtein distance implementation in Java for extremely fast prefix string searching and string similarity.☆45Updated 3 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Updated 3 years ago
- Applying "Load What You Need: Smaller Versions of Multilingual BERT" to LaBSE☆19Updated 4 years ago
- ☆69Updated 5 years ago
- Subword Language Model for Query Auto-Completion☆67Updated 6 years ago
- This repository contains code and datasets for our paper on the effects of document multiplicity while the context size is fixed in Retri…☆18Updated 10 months ago
- SuperMinHash: A New Minwise Hashing Algorithm for Jaccard Similarity Estimation, Simhash and SimhashIndex☆19Updated 3 years ago
- YASEM - Yet Another Splade|Sparse Embedder - A simple and efficient library for SPLADE embeddings☆13Updated 8 months ago
- Label shift estimation for transfer difficulty with Familiarity.☆10Updated last year
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆24Updated 6 months ago
- Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.☆19Updated 7 months ago
- Lightweight method based on shortest path on word graphs and NLP to generate single sentence summaries that highly relevant and grammatic…☆19Updated 9 years ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Updated 3 weeks ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- Word2VecfJava: Java implementation of Dependency-Based Word Embeddings and extensions☆16Updated 7 years ago
- *high-load* benchmarking tool☆16Updated this week
- Examples, tutorials and use cases for Marian, including our WMT-2017/18 baselines.☆81Updated 2 years ago
- Extractive and Compressive Neural Summarization Based on Summary State Representations (NAACL 2019)☆16Updated 5 years ago
- My NER Experiments with ModernBERT and Ettin☆26Updated 6 months ago
- Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)☆16Updated 9 years ago
- Evaluation tools shared across anserini, pyserini, and pygaggle☆35Updated last week
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆86Updated 4 years ago
- My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".☆52Updated 11 months ago
- User-friendly viewer for Parquet files☆10Updated 3 weeks ago
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 3 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- Symphony Machine Translation☆38Updated 5 years ago
- Model implementation for the contextual embeddings project☆40Updated 8 months ago