godatadriven / build-your-own-search-engine
This repository contains code to build an MVP search engine with google like interface.
☆16Updated 4 years ago
Alternatives and similar repositories for build-your-own-search-engine:
Users that are interested in build-your-own-search-engine are comparing it to the libraries listed below
- A few end to end examples that use data-describe☆16Updated last year
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Events about the open source data stack☆13Updated 2 years ago
- ☆10Updated 3 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆29Updated 2 years ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- DataHub on AWS demonstration resources☆10Updated last year
- Apache Spark based framework for analysis A/B experiments☆13Updated 2 months ago
- Using the Parquet file format with Python☆15Updated last year
- A lightweight tool to measure the full memory of a Python session☆19Updated 3 months ago
- ☆14Updated last month
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 2 months ago
- Codeless Deep Learning with KNIME☆14Updated last year
- ☆12Updated last year
- Awesome list of dataops products, open source and resources☆24Updated 2 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 7 years ago
- Creating Apps with the ChatGPT API☆15Updated last year
- Full stack data engineering tools and infrastructure set-up☆47Updated 3 years ago
- ☆30Updated 3 years ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- A collection of my favorite tech-related blog posts.☆9Updated last week
- Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb☆19Updated last year
- Common Paper Service Level Agreement☆13Updated 9 months ago
- InGen is a command line tool written on top of pandas and great_expectations to perform small scale data transformations and validations …☆14Updated last month
- Astronomer Vendor Images☆12Updated this week
- Supported datasources for MindsDB☆16Updated 8 months ago