JamesMcGuigan / elasticsearch-faiss-cosine-similarity-search
Cosine Similary Search in ElasticSearch + FAISS GPU
☆11Updated 2 years ago
Related projects: ⓘ
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 3 years ago
- Search system on top of Elasticsearch, Kubeflow and Katib☆29Updated last year
- Language detection using Spacy and Fasttext☆53Updated 9 months ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆69Updated last year
- Tutorial for developing a topic-specific autocomplete function for Jupyter notebooks based on PyTorch and Google Colaboratory.☆36Updated 5 years ago
- Supporting code for Learning to Rank (LTR) presentation☆16Updated 5 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- Source code for the Apple reproduction☆30Updated 3 years ago
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Sear…☆84Updated 3 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆46Updated 2 years ago
- zero-vocab or low-vocab embeddings☆16Updated 2 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 2 years ago
- ☆13Updated 3 years ago
- Neural Elastic Inference and Search☆19Updated 4 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 3 years ago
- ☆69Updated 3 years ago
- An extension package of 🤗 Datasets that provides support for executing arbitrary SQL queries on HF datasets☆31Updated 7 months ago
- 🦖 Streamlined Recommender Systems with TensorFlow and KubeFlow☆18Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP models…☆37Updated 2 years ago
- Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr☆19Updated 2 years ago
- allennlp + streamlit demo☆21Updated 4 years ago
- The code describes how to load fastText vectors onto spaCy☆18Updated 3 years ago
- sequence tagging with spaCy and crfsuite☆18Updated last year
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆37Updated last year
- Fast edit distance Python extension written in Cython/C++. Supports Levenshtein distance and Damerau Optimal String Alignment (OSA) dista…☆23Updated last week
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).☆67Updated last month
- ClickModels for Search Engines Implemented on top of Cython.☆13Updated 3 years ago
- ☆16Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated last year