rayyan17 / jobAnalytics_and_search
JobAnalytics system consumes data from multiple sources and provides valuable information to both job hunters and recruiters.
☆31Updated 2 years ago
Alternatives and similar repositories for jobAnalytics_and_search:
Users that are interested in jobAnalytics_and_search are comparing it to the libraries listed below
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆26Updated 2 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆13Updated 3 years ago
- Udacity Data Pipeline Exercises☆15Updated 4 years ago
- pyspark dataframe made easy☆16Updated 3 years ago
- Full stack data engineering tools and infrastructure set-up☆49Updated 4 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 4 years ago
- A real-time event pipeline around Kafka Ecosystem for Chicago Transit Authority.☆29Updated last year
- Apache Spark Guide☆30Updated 3 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆20Updated 6 years ago
- ☆12Updated 6 years ago
- A repo to track data engineering projects☆13Updated 2 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Data engineering interviews Q&A for data community by data community☆64Updated 4 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 8 years ago
- PySpark-ETL☆23Updated 5 years ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- PipeRider dbt workshop for DataTalksClub DE Zoomcamp☆17Updated last year
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆134Updated 4 years ago
- A curated list of awesome Databricks resources, including Spark☆16Updated 7 months ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- Containerized end-to-end analytics of Spotify data using Python, dbt, Postgres, and Metabase☆124Updated 2 years ago
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆29Updated 4 years ago
- Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!☆30Updated 4 years ago
- PySpark phonetic and string matching algorithms☆39Updated last year
- ☆18Updated 3 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆26Updated 2 years ago
- Insight Data Engineering Project☆15Updated 3 years ago