Breaka84 / Spooq
☆9Updated 9 months ago
Alternatives and similar repositories for Spooq
Users that are interested in Spooq are comparing it to the libraries listed below
Sorting:
- Presentation and notebook sources for Scala IO and Scale by the Bay 2018 Spark and Frameless talk☆11Updated 6 years ago
- Some Avro operations in Scala☆10Updated 5 months ago
- Extensible streaming ingestion pipeline on top of Apache Spark☆44Updated last year
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 5 years ago
- Software tool to manage your notes, scripts, code examples, configs,... to publish them as gists or snippets☆39Updated last week
- Data quality control tool built on spark and deequ☆24Updated 2 months ago
- Hadoop/Hive/Spark container to perform CI tests☆11Updated 4 years ago
- Ranking Evaluation and Batch Processing. Kolibri provides a framework for concurrent multi-node (synced via storage) executions, written …☆28Updated last year
- Skeleton project for Apache Airflow training participants to work on.☆16Updated 4 years ago
- ☆14Updated 7 years ago
- Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.☆20Updated 3 years ago
- Testing Scala code with scalatest☆12Updated 2 years ago
- Lambdas covering supporter operations, mostly in life operations☆11Updated this week
- A library that brings useful functions from various modern database management systems to Apache Spark☆58Updated last year
- Standalone alternatives to Kafka Connect Connectors☆43Updated this week
- machine learning playground☆12Updated 8 years ago
- UI to run SQL on Delta Lake tables and visualize the variations of the result among tables versions☆12Updated 5 years ago
- Sample processing code using Spark 2.1+ and Scala☆52Updated 4 years ago
- A sbt plugin for creating NiFi Archive bundles to support the classloader isolation model of NiFi.☆10Updated 2 years ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- ☆10Updated 2 years ago
- Common components used across the datamountaineer kafka connect connectors☆21Updated 4 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Atomic Scala Book Solutions - for Beginners and first time Functional Programmers☆11Updated 5 years ago
- Scala API for Apache Spark SQL high-order functions☆14Updated last year
- Utilities for writing tests that use Apache Spark.☆24Updated 6 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Updated 5 months ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 8 months ago
- ☆23Updated 4 months ago