Salmon-Brain / dead-salmon-brainLinks
Apache Spark based framework for analysis A/B experiments
☆15Updated last year
Alternatives and similar repositories for dead-salmon-brain
Users that are interested in dead-salmon-brain are comparing it to the libraries listed below
Sorting:
- Data quality control tool built on spark and deequ☆25Updated 3 weeks ago
- TensorFlow Processor for Spring Cloud Dataflow☆24Updated 8 years ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆47Updated 7 months ago
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 3 months ago
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- An open-source framework that allows you to easily monitor your web applications using end-end browser tests.☆15Updated 4 years ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆38Updated 3 years ago
- Extract, Transform, Load (ETL) refers to a process in database usage and especially in data warehousing. This repository contains a s…☆21Updated 8 years ago
- Text similarity based on Word2Vec vectors.☆10Updated 8 years ago
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- ☆10Updated 3 years ago
- 💻 CLI for reporting events to Faros platform☆14Updated last week
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Updated last year
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Updated last week
- Dremio Flight connector. Access Dremio using Arrow flight☆39Updated 5 years ago
- A better logging service☆101Updated 6 months ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆76Updated 2 years ago
- Code, Examples, Templates and Scripts for DataWorksSummit 2017 Sydney Talk☆17Updated 8 years ago
- Friendly ML feature store☆45Updated 3 years ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 5 months ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆42Updated 2 years ago
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated last week
- Flink stream filtering examples☆19Updated 9 years ago
- Big Data Demystified meetup and blog examples☆31Updated last year
- Service for automatically managing and cleaning up unreferenced data☆48Updated 4 months ago
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆13Updated 3 years ago
- ☆21Updated 9 years ago
- Library of Prefect tasks and utilities.☆10Updated last year