godatadriven / build-your-own-search-engineLinks
This repository contains code to build an MVP search engine with google like interface.
☆15Updated 4 months ago
Alternatives and similar repositories for build-your-own-search-engine
Users that are interested in build-your-own-search-engine are comparing it to the libraries listed below
Sorting:
- Events about the open source data stack☆13Updated 3 years ago
- Demos of Materialize, the operational data warehouse.☆52Updated 9 months ago
- Basic tutorial of using Apache Airflow☆36Updated 7 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 5 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 6 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆64Updated this week
- 💻 CLI for reporting events to Faros platform☆14Updated last month
- Scalable Feature Store☆54Updated 2 years ago
- Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …☆53Updated 11 months ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!☆31Updated 5 years ago
- Code examples for the Introduction to Kubeflow course☆14Updated 4 years ago
- Beneath is a serverless real-time data platform ⚡️☆84Updated 3 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆19Updated 6 years ago
- ICIJ #Fincen Files in Neo4j☆40Updated 5 years ago
- Challenge Data Engineer☆25Updated 3 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- Big Data Demystified meetup and blog examples☆31Updated last year
- ☆30Updated last year
- pglineage is a tool to create data flow diagrams for PostgreSQL by analyzing SQL☆17Updated last year
- ☆10Updated 4 years ago
- Build and deploy a serverless data pipeline on AWS with no effort.☆111Updated 2 years ago
- ☆76Updated this week
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- This repo provides a starting point for building applications using SingleStore, Redpanda (by Vectorized), and the Go language. SingleSto…☆23Updated last year
- ☆31Updated 4 years ago
- Notebooks for the ML Link Prediction Course☆14Updated 5 years ago
- A real-time tech course finder, created using Elasticsearch, Python, React+Redux, Docker, and Kubernetes.☆146Updated 3 weeks ago
- Simple samples for writing ETL transform scripts in Python☆24Updated 2 weeks ago