godatadriven / build-your-own-search-engine
This repository contains code to build an MVP search engine with google like interface.
☆15Updated 4 years ago
Alternatives and similar repositories for build-your-own-search-engine:
Users that are interested in build-your-own-search-engine are comparing it to the libraries listed below
- A few end to end examples that use data-describe☆16Updated last year
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- ☆10Updated 3 years ago
- data-mesh-demo☆13Updated 2 years ago
- ☆11Updated 11 months ago
- Events about the open source data stack☆13Updated 2 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆30Updated 2 years ago
- A collection of my favorite tech-related blog posts.☆9Updated 2 weeks ago
- A set of tools to accelerate work in Jupyter notebooks.☆11Updated 4 years ago
- Build a directory full of files into a SQLite database☆12Updated last year
- This project is created to promote and advocate the use of FOSS machine learning.☆44Updated this week
- 🔍Your Data Quality Detector / Gain insight into your data and get it ready for use before you start working with it 💡📊🛠💎☆16Updated 2 years ago
- Documentation and resources for deploying JupyterHub on Hadoop☆18Updated 5 years ago
- There are many secrets management utilities, this one is ours … shhh☆11Updated last year
- 💻 CLI for reporting events to Faros platform☆14Updated 3 months ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated last year
- Awesome list of dataops products, open source and resources☆24Updated 2 years ago
- A curated list of awesome Databricks resources, including Spark☆16Updated 7 months ago
- Asynchronous tasks on the cloud☆21Updated last year
- ☆12Updated 5 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 7 years ago
- Flask based UI for displaying & segmenting a single database table☆15Updated 2 years ago
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 3 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆36Updated last year
- Machine Learning with BigQuery ML, published by Packt☆31Updated 2 years ago
- pycaret-git-actions☆15Updated 4 years ago
- bamboolib - template for creating your own binder notebook☆21Updated 3 years ago
- Ssebowa is free and open source library in Python that provides generative-ai models.☆14Updated last year
- ☆17Updated 2 years ago