A Time Series Library for Apache Spark
☆1,022Jul 3, 2020Updated 5 years ago
Alternatives and similar repositories for flint
Users that are interested in flint are comparing it to the libraries listed below
Sorting:
- A library for time series analysis on Apache Spark☆1,196Oct 13, 2020Updated 5 years ago
- MLeap: Deploy ML Pipelines to Production☆1,535Jan 12, 2026Updated last month
- The missing MatPlotLib for Scala + Spark☆731Jan 30, 2022Updated 4 years ago
- Beaker Extensions for Jupyter Notebook☆2,841Dec 4, 2023Updated 2 years ago
- Expressive types for Spark.☆896Feb 22, 2026Updated last week
- Breeze is/was a numerical processing library for Scala.☆3,457Oct 4, 2025Updated 4 months ago
- Mirror of Apache Toree (Incubating)☆749Feb 21, 2026Updated last week
- Base classes to use when writing tests with Spark☆1,549Dec 22, 2025Updated 2 months ago
- High performance datastore for time series and tick data☆3,088Apr 8, 2024Updated last year
- Sparkling Water provides H2O functionality inside Spark cluster☆977Nov 5, 2025Updated 3 months ago
- A scalable machine learning library on Apache Spark☆796Aug 30, 2021Updated 4 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,007Oct 5, 2022Updated 3 years ago
- Simple and Distributed Machine Learning☆5,200Feb 14, 2026Updated 2 weeks ago
- REST job server for Apache Spark☆2,842Jul 8, 2025Updated 7 months ago
- Distributed Prometheus time series database☆1,462Updated this week
- Interactive and Reactive Data Science using Scala and Spark.☆3,152May 16, 2023Updated 2 years ago
- GraphFrames is a package for Apache Spark which provides DataFrame-based Graphs☆1,135Feb 6, 2026Updated 3 weeks ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Nov 21, 2022Updated 3 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows…☆2,272Sep 29, 2023Updated 2 years ago
- A Scala API for Apache Beam and Google Cloud Dataflow.☆2,615Feb 12, 2026Updated 2 weeks ago
- TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.☆3,858Jul 10, 2023Updated 2 years ago
- SADDLE: Scala Data Library☆508Mar 21, 2020Updated 5 years ago
- Real Time Analytics and Data Pipelines based on Spark Streaming☆531Oct 24, 2019Updated 6 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Sep 9, 2025Updated 5 months ago
- Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V,…☆8,694Jan 28, 2026Updated last month
- Automatic extraction of relevant features from time series:☆9,127Nov 15, 2025Updated 3 months ago
- Spark MLlib code optimized to efficiently support sparse data☆51Dec 22, 2016Updated 9 years ago
- A Scala feature transformation library for data science and machine learning☆474Feb 7, 2025Updated last year
- Open Source ML Model Versioning, Metadata, and Experiment Management☆1,744Jul 23, 2024Updated last year
- Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet f…☆1,879Jan 2, 2026Updated last month
- Open source time series library for Python☆2,140Oct 24, 2023Updated 2 years ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,602Feb 21, 2026Updated last week
- Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.☆20,039Feb 20, 2026Updated last week
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,583Feb 17, 2026Updated last week
- Examples for High Performance Spark☆526Updated this week
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Nov 9, 2023Updated 2 years ago
- A Scala kernel for Jupyter☆1,620Feb 9, 2026Updated 2 weeks ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,753Dec 8, 2025Updated 2 months ago