leerssej / SEDESchemaLinks
A python script for generating an Entity Relationship Diagram for the Stack Exchange Data Explorer Schema
☆57Updated 6 years ago
Alternatives and similar repositories for SEDESchema
Users that are interested in SEDESchema are comparing it to the libraries listed below
Sorting:
- A real-time tech course finder, created using Elasticsearch, Python, React+Redux, Docker, and Kubernetes.☆146Updated last week
- Databases: Concepts, commands, codes, interview questions and more...☆57Updated 3 years ago
- Build a Search Engine with Python + Elasticsearch☆95Updated 2 years ago
- ∞ Priceloop Engineering Conventions for Scala, Python, Git Workflow etc☆100Updated 3 years ago
- Python Algorithm Visualization☆48Updated 8 years ago
- How to build an awesome data engineering team☆101Updated 6 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆47Updated 7 months ago
- A SQL implementation of an ancient handwriting recognition algorithm.☆80Updated 6 years ago
- Insight Data Engineering project: A platform built in HDFS, Spark and Airflow to help you to find social influencers from GitHub Net…☆16Updated last year
- Data pipelines from re-usable components☆107Updated last month
- Use Kafka and Apache Spark streaming to perform click stream analytics☆76Updated 5 years ago
- Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializi…☆32Updated 6 years ago
- Using Debezium to capture data changes from databases and populate these as historic evolution and table replication in Snowflake☆24Updated 2 years ago
- Resources for tackling record linkage / deduplication / data matching problems☆125Updated last year
- List of anti patterns in SQL☆198Updated 8 years ago
- ☆128Updated 5 years ago
- Basic tutorial of using Apache Airflow☆36Updated 7 years ago
- Simple alert system implemented in Kafka and Python☆95Updated 7 years ago
- Python scripts to import StackExchange data dump into Postgres DB.☆89Updated 3 years ago
- Sharing interesting and noteworthy Data Engineering content☆70Updated 9 years ago
- A LSM-Tree key/value database in Python.☆24Updated last year
- Supporting content (slides and exercises) for the Pearson video series covering best practices for developing scalable applications with …☆53Updated 11 months ago
- Prefix Search as a Service, built on top of a prefix hash tree using Node.js, Express, Redis, and MongoDB☆99Updated 7 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- Data engineering interviews Q&A for data community by data community☆65Updated 5 years ago
- Airflow basics tutorial☆396Updated 4 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆87Updated 3 years ago
- Public source code for the Batch Processing with Apache Beam (Python) online course☆18Updated 5 years ago
- ☆19Updated 7 years ago
- Random dataframe and database table generator☆311Updated 4 years ago