Salmon-Brain / dead-salmon-brainLinks
Apache Spark based framework for analysis A/B experiments
β15Updated 11 months ago
Alternatives and similar repositories for dead-salmon-brain
Users that are interested in dead-salmon-brain are comparing it to the libraries listed below
Sorting:
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data πβ35Updated 3 years ago
- Open source task scheduler with dependency managementβ15Updated 7 years ago
- π» CLI for reporting events to Faros platformβ14Updated 2 months ago
- Using the Parquet file format (with Avro) to process data with Apache Flinkβ14Updated 10 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Storβ¦β41Updated 2 years ago
- A component which takes nifi flow xml file as input and converts it into terraform script for creating/updating a flow on nifiβ28Updated 3 years ago
- Friendly ML feature storeβ45Updated 3 years ago
- An Example Dremio ARP driven connector that supports SQLLiteβ19Updated last year
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilitiesβ26Updated 9 months ago
- Text similarity based on Word2Vec vectors.β10Updated 8 years ago
- An implementation of the DatasourceV2 interface of Apache Sparkβ’ for writing Spark Datasets to Apache Druidβ’.β43Updated 2 weeks ago
- Data quality control tool built on spark and deequβ25Updated 7 months ago
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.β57Updated 3 years ago
- Data Catalog for Databases and Data Warehousesβ35Updated last year
- spark-drools tutorialsβ16Updated last year
- β31Updated 2 years ago
- pysh-db - The Data Science Toolkit (DSK)β13Updated 6 years ago
- β21Updated 2 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines anβ¦β62Updated last year
- Provide an easy way with Python to protect your data sources by searching its metadata. π‘οΈβ18Updated this week
- Code, Examples, Templates and Scripts for DataWorksSummit 2017 Sydney Talkβ17Updated 8 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.β112Updated 5 years ago
- β10Updated 3 years ago
- Collection of generic Apache Flink operatorsβ17Updated 8 years ago
- Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Servicesβ27Updated last week
- An open-source framework that allows you to easily monitor your web applications using end-end browser tests.β14Updated 4 years ago
- β18Updated 3 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applicationsβ36Updated 10 months ago
- Library of Prefect tasks and utilities.β10Updated last year
- This project is created to promote and advocate the use of FOSS machine learning.β47Updated 5 months ago