godatadriven / build-your-own-search-engine
This repository contains code to build an MVP search engine with google like interface.
☆15Updated 4 years ago
Alternatives and similar repositories for build-your-own-search-engine:
Users that are interested in build-your-own-search-engine are comparing it to the libraries listed below
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Events about the open source data stack☆13Updated 3 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- A few end to end examples that use data-describe☆16Updated last year
- ☆10Updated 3 years ago
- duckdb-etl-framework☆10Updated 4 months ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- data-mesh-demo☆13Updated 3 years ago
- Robotic Process Automation Projects, published by Packt☆34Updated 2 years ago
- My dot files in one place - extensively edited over time. Your mileage may vary☆2Updated 8 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆32Updated 3 years ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago
- A starter project to create Arc jobs using the Jupyter Notebook interface☆22Updated 4 years ago
- Asynchronous tasks on the cloud☆21Updated last year
- ☆12Updated 5 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year
- 💻 CLI for reporting events to Faros platform☆14Updated 6 months ago
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- Lightweight configuration and access to multiple databases in a single project☆38Updated last year
- Python implementation of Age-Partitioned Bloom Filter with S3 periodic backup support.☆11Updated 3 months ago
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 2 years ago
- Git scrapers for scraping the fediverse☆16Updated this week
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- How to do data science with Optimus, Spark and Python.☆19Updated 5 years ago
- DuckDB Extension for cryptographic hash functions and HMAC☆17Updated this week
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated 8 months ago
- Plugin for Intake to read from SQL servers☆15Updated last year
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- ☆11Updated 4 months ago