Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine
☆245May 31, 2023Updated 2 years ago
Alternatives and similar repositories for semantic-search-through-wikipedia-with-weaviate
Users that are interested in semantic-search-through-wikipedia-with-weaviate are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Dec 12, 2021Updated 4 years ago
- This repository will contain a demo using Weaviate with data and metadata from the arXiv dataset.☆15Mar 8, 2022Updated 4 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago
- H&M Fashion Image similarity search with Weaviate and DocArray☆43Mar 18, 2024Updated 2 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆339Apr 25, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Jan 5, 2022Updated 4 years ago
- Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering☆174Jun 6, 2021Updated 4 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36May 18, 2023Updated 2 years ago
- Converter from UD-trees to BART representation☆35Mar 6, 2024Updated 2 years ago
- AI apps/benchmark for legaltech☆115Sep 22, 2021Updated 4 years ago
- ☆19Oct 10, 2020Updated 5 years ago
- ☆13Aug 13, 2020Updated 5 years ago
- code and data for paper "One-shot Text Field Labeling using Attention and BeliefPropagation for Structure Information Extraction"☆61Aug 9, 2020Updated 5 years ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆18Jun 9, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆71Jun 9, 2025Updated 11 months ago
- Agglomerative hierarchical clustering in JavaScript☆19Dec 17, 2024Updated last year
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 3 months ago
- BookNLP, a natural language processing pipeline for books☆915Jul 31, 2024Updated last year
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Helm charts to deploy Weaviate to k8s☆66Apr 16, 2026Updated 3 weeks ago
- Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with struc…☆16,141Updated this week
- PyTorch implementation for MRL☆23Feb 22, 2024Updated 2 years ago
- Semantic search using Transformers and others☆110Aug 27, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Dense Passage Retrieval using tensorflow-keras on TPU☆17Jun 27, 2021Updated 4 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Jan 17, 2023Updated 3 years ago
- Compute Sentence Embeddings Fast!☆626Mar 2, 2023Updated 3 years ago
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟☆52Jan 8, 2022Updated 4 years ago
- Tutorial and talk about the Reasonable Ontology Language at the Knowledge Graph Conference 2022.☆12May 9, 2023Updated 3 years ago
- A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Ope…☆1,580Feb 15, 2023Updated 3 years ago
- Dead simple cron service for making HTTP calls on a regular schedule.☆14Jul 11, 2020Updated 5 years ago
- Search with BERT vectors in Solr, Elasticsearch, OpenSearch and GSI APU☆166Aug 28, 2024Updated last year
- SummVis is an interactive visualization tool for text summarization.☆253Jun 17, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆28Aug 22, 2023Updated 2 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆119Aug 3, 2021Updated 4 years ago
- Remove outdated content via https://www.google.com/webmasters/tools/removals in bulk!☆22Dec 25, 2020Updated 5 years ago
- Active Learning for Text Classification in Python☆640Apr 17, 2026Updated 3 weeks ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆341Jul 6, 2023Updated 2 years ago
- Doubt your data, find bad labels.☆516Jul 15, 2024Updated last year
- A curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.☆362Dec 9, 2025Updated 5 months ago