elbaulp / DPASF
My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)
☆18Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for DPASF
- Anomaly Detection model uses Spark for training and Spark Streaming for testing☆66Updated 8 years ago
- Flink stream filtering examples☆19Updated 8 years ago
- ☆11Updated 8 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28Updated 4 years ago
- machine learning playground☆12Updated 7 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 7 years ago
- Data Exploration Using Spark 2.0☆14Updated 6 years ago
- type-class based data cleansing library for Apache Spark SQL☆79Updated 5 years ago
- A bridge to Apache Atlas for provenance metadata created in course of using Apache NiFi☆15Updated last year
- Featureselection methods as Spark MLlib Pipelines☆30Updated 6 years ago
- A Spark datasource for the HadoopCryptoLedger library☆13Updated 2 years ago
- Flink Examples☆39Updated 8 years ago
- Real-time query spark and visualise it as graph.☆24Updated 7 years ago
- Real-time time series prediction library with standalone server☆36Updated 3 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 7 years ago
- ☆14Updated 8 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆64Updated 4 years ago
- Schema Registry integration for Apache Spark☆39Updated 2 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated last year
- Data Sketches for Apache Spark☆21Updated last year
- Parameter Server implementation in Apache Flink.☆14Updated 6 years ago
- Building blocks and patterns for building data prep transformations and feature engineering in Spark.☆16Updated 8 years ago
- A NiFi client library for JVM languages☆13Updated 8 years ago
- Convert a CSV fle to ORCFile☆26Updated 5 years ago
- ☆41Updated 7 years ago
- Detecting outliers in a dataset using Spark☆41Updated 8 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 8 years ago
- Labs and data files for a full-day Spark workshop☆24Updated last year
- This lab teaches you how to create a realtime dashboard of stock prices using Hortonworks Data Platform and NiFi☆24Updated 8 years ago
- ☆18Updated 5 years ago