scalable analysis of images and time series
☆822Jan 6, 2017Updated 9 years ago
Alternatives and similar repositories for thunder
Users that are interested in thunder are comparing it to the libraries listed below
Sorting:
- Unified interface for local and distributed ndarrays☆157Oct 13, 2018Updated 7 years ago
- PySpark + Scikit-learn = Sparkit-learn☆1,149Dec 31, 2020Updated 5 years ago
- Data Visualization Server☆958Nov 23, 2016Updated 9 years ago
- reproducible executable environments☆444Oct 27, 2017Updated 8 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- algorithms for mass univariate regression☆13Aug 21, 2018Updated 7 years ago
- Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning☆3,168Aug 30, 2021Updated 4 years ago
- Sparkling Pandas☆362Jul 6, 2023Updated 2 years ago
- A library for time series analysis on Apache Spark☆1,198Oct 13, 2020Updated 5 years ago
- Distributed Neural Networks for Spark☆611Jul 23, 2020Updated 5 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆977Nov 5, 2025Updated 4 months ago
- next generation slides for Jupyter Notebooks☆168Apr 17, 2024Updated last year
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Jul 3, 2018Updated 7 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,035Nov 21, 2022Updated 3 years ago
- Open source time series library for Python☆2,141Oct 24, 2023Updated 2 years ago
- Mirror of Apache Toree (Incubating)☆749Mar 9, 2026Updated last week
- Visualize streaming machine learning in Spark☆177Jun 29, 2017Updated 8 years ago
- Interactive notebooks for trying analyses and exploring datasets☆32Aug 10, 2015Updated 10 years ago
- Python helpers for building dashboards using Flask and React☆2,269Jun 2, 2025Updated 9 months ago
- friendly command-line tool for initializing python packages☆18Jun 24, 2020Updated 5 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Aug 16, 2021Updated 4 years ago
- Scala backend for IPython☆321Jan 9, 2019Updated 7 years ago
- A data science IDE for Python☆3,900Apr 16, 2018Updated 7 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Oct 20, 2020Updated 5 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- Distributed Prometheus time series database☆1,459Updated this week
- Machine Learning Time-Series Platform☆683Jan 17, 2025Updated last year
- Interactive tools and developer experiences for Big Data on Google Cloud Platform.☆969Sep 2, 2022Updated 3 years ago
- A probabilistic programming language in TensorFlow. Deep generative models, variational inference.☆4,842Mar 18, 2024Updated 2 years ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,753Dec 8, 2025Updated 3 months ago
- Distributed Deep Learning on Spark☆403Oct 8, 2016Updated 9 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆156Nov 21, 2016Updated 9 years ago
- Enabling queries on compressed data.☆282Dec 16, 2023Updated 2 years ago
- Parallel computing with task scheduling☆13,765Mar 12, 2026Updated last week
- Interactive and Reactive Data Science using Scala and Spark.☆3,150May 16, 2023Updated 2 years ago
- Z-Brain Viewer and Analysis Scripts for MAP-Mapping☆25Apr 11, 2025Updated 11 months ago
- Spyke Viewer is a multi-platform GUI application for navigating, analyzing and visualizing electrophysiological datasets.☆24Mar 7, 2016Updated 10 years ago
- REST job server for Apache Spark☆2,845Mar 3, 2026Updated 2 weeks ago
- Quickly and accurately render even the largest data.☆3,519Feb 19, 2026Updated last month