bumble-tech / buzzwordsLinks
GPU-Powered Topic Modelling
☆70Updated 2 years ago
Alternatives and similar repositories for buzzwords
Users that are interested in buzzwords are comparing it to the libraries listed below
Sorting:
- Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine☆242Updated 2 years ago
- UniSim is a package for efficient similarity computation, fuzzy matching, and clustering of data.☆138Updated 3 months ago
- Production-grade embedding generation, for any length of text, for transformer models.☆23Updated last month
- ☆47Updated 2 years ago
- Efficient BM25 with DuckDB 🦆☆52Updated 6 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆94Updated last year
- Tools for interactive visual exploration of semantic embeddings.☆35Updated 10 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- Common crawl extractor☆78Updated last year
- With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this…☆21Updated this week
- This is the repo for the container that holds the models for the text2vec-transformers module☆51Updated last week
- Document level Attitude and Relation Extraction toolkit (AREkit) for sampling and processing large text collections with ML and for ML☆63Updated 6 months ago
- Information extraction from English and German texts based on predicate logic☆137Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆79Updated last year
- Various Jupyter notebooks about Common Crawl data☆55Updated 3 months ago
- Explore vector similarity in Redis☆115Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- RaKUn 2.0 - A fast keyword detection algorithm☆67Updated 3 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆23Updated 4 months ago
- Expose a Top2Vec model with a REST API.☆90Updated 2 years ago
- Tools to construct and process Common Crawl webgraphs☆92Updated 2 weeks ago
- A production-ready, scalable Indexer for the Jina neural search framework, based on HNSW and PSQL☆30Updated 2 years ago
- Aim-spaCy integration☆34Updated 2 years ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated 2 years ago
- 📚 Datasets and models for instruction-tuning☆238Updated last year
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Updated last year
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 3 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆66Updated 2 years ago
- The definitive guide to using Vector Search to solve your semantic search production workload needs.☆273Updated 2 years ago