godatadriven / build-your-own-search-engine
This repository contains code to build an MVP search engine with google like interface.
☆15Updated 4 years ago
Alternatives and similar repositories for build-your-own-search-engine:
Users that are interested in build-your-own-search-engine are comparing it to the libraries listed below
- A few end to end examples that use data-describe☆16Updated last year
- ☆12Updated last year
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated 7 months ago
- data-mesh-demo☆13Updated 2 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated 4 months ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 4 years ago
- Events about the open source data stack☆13Updated 2 years ago
- pycaret-git-actions☆15Updated 4 years ago
- ☆10Updated 3 years ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- A curated list of awesome Databricks resources, including Spark☆17Updated 8 months ago
- Robotic Process Automation Projects, published by Packt☆35Updated 2 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆43Updated 3 weeks ago
- Learning and buiding API using Fast API☆14Updated 3 years ago
- ☆11Updated 3 years ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 2 years ago
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 4 years ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- ☆30Updated 3 years ago
- bamboolib - template for creating your own binder notebook☆21Updated 3 years ago
- Codeless Deep Learning with KNIME☆14Updated 2 years ago
- A Python sampling profiler for AWS Lambda functions (and not only).☆12Updated 3 years ago