godatadriven / build-your-own-search-engine
This repository contains code to build an MVP search engine with google like interface.
☆15Updated 4 years ago
Alternatives and similar repositories for build-your-own-search-engine
Users that are interested in build-your-own-search-engine are comparing it to the libraries listed below
Sorting:
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Events about the open source data stack☆13Updated 3 years ago
- A set of tools to accelerate work in Jupyter notebooks.☆11Updated 5 years ago
- A few end to end examples that use data-describe☆16Updated 2 years ago
- ☆10Updated 3 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago
- Example how to pre-process news articles with textbox and index on Elastic Search☆13Updated 7 years ago
- Awesome list of dataops products, open source and resources☆24Updated 3 years ago
- Asynchronous tasks on the cloud☆21Updated last year
- Supported datasources for MindsDB☆16Updated last week
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 3 years ago
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- A Data Mesh demo repository☆13Updated 7 months ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 6 months ago
- duckdb-etl-framework☆10Updated 4 months ago
- Neural Solr = Solr 9 + Mighty Inference + Node☆17Updated 2 years ago
- The IBM DB2 adapter plugin for dbt (data build tool)☆10Updated 11 months ago
- A library for creating full representations of Mozilla telemetry pings.☆11Updated last month
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Codeless Deep Learning with KNIME☆14Updated 2 years ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Git scrapers for scraping the fediverse☆16Updated this week
- A lightweight tool to measure the full memory of a Python session☆19Updated 2 months ago
- Write data & AI pipelines in (SQL, Spark, Pandas) and deploy to the cloud, simplified☆34Updated 3 weeks ago
- Instant search for and access to many datasets in Pyspark.☆34Updated 2 years ago
- Orchest quickstart pipeline☆18Updated 2 years ago
- A utility for labeling clusters of text data.☆28Updated 3 years ago