Salmon-Brain / dead-salmon-brainLinks
Apache Spark based framework for analysis A/B experiments
☆15Updated last year
Alternatives and similar repositories for dead-salmon-brain
Users that are interested in dead-salmon-brain are comparing it to the libraries listed below
Sorting:
- dbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.☆57Updated 3 years ago
- This project is created to promote and advocate the use of FOSS machine learning.☆47Updated 8 months ago
- 💻 CLI for reporting events to Faros platform☆14Updated last month
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆42Updated 3 years ago
- Chatlytics is a data query and visualization platform for chat!☆13Updated 8 years ago
- Data quality control tool built on spark and deequ☆25Updated last month
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆39Updated 2 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated last year
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆39Updated 3 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Updated last year
- Marquez Web UI☆21Updated 5 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆114Updated 5 years ago
- Open source task scheduler with dependency management☆15Updated 7 years ago
- Glue is an enterprise data model for the buy side, tailored for Wealth and Asset Managers and covering key entities such as Party, Busine…☆23Updated 2 years ago
- Automation, Data Mash, Message Learning, AI Ops, Quantum Ops☆13Updated this week
- Data Catalog for Databases and Data Warehouses☆35Updated last year
- An implementation of the DatasourceV2 interface of Apache Spark™ for writing Spark Datasets to Apache Druid™.☆43Updated 3 weeks ago
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 months ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆76Updated 2 years ago
- Common API for all "second gen" AutoML APIs: Auger.AI, Google Cloud AutoML and Azure AutoML☆41Updated last year
- Library of Prefect tasks and utilities.☆10Updated last year
- ☆20Updated 3 years ago
- ☆18Updated 3 years ago
- pysh-db - The Data Science Toolkit (DSK)☆13Updated 7 years ago
- Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.☆15Updated last year
- Example for simple Apache Arrow Flight service with Apache Spark and TensorFlow clients☆37Updated 4 years ago
- ☆30Updated last week
- Provide an easy way with Python to protect your data sources by searching its metadata. 🛡️☆18Updated last month
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 10 years ago
- Friendly ML feature store☆45Updated 3 years ago