scalable analysis of images and time series
☆822Jan 6, 2017Updated 9 years ago
Alternatives and similar repositories for thunder
Users that are interested in thunder are comparing it to the libraries listed below
Sorting:
- Unified interface for local and distributed ndarrays☆157Oct 13, 2018Updated 7 years ago
- PySpark + Scikit-learn = Sparkit-learn☆1,151Dec 31, 2020Updated 5 years ago
- Data Visualization Server☆958Nov 23, 2016Updated 9 years ago
- reproducible executable environments☆444Oct 27, 2017Updated 8 years ago
- algorithms for mass univariate regression☆13Aug 21, 2018Updated 7 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Apr 18, 2017Updated 8 years ago
- Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning☆3,170Aug 30, 2021Updated 4 years ago
- Distributed Neural Networks for Spark☆611Jul 23, 2020Updated 5 years ago
- Sparkling Pandas☆363Jul 6, 2023Updated 2 years ago
- A library for time series analysis on Apache Spark☆1,196Oct 13, 2020Updated 5 years ago
- Sparkling Water provides H2O functionality inside Spark cluster☆977Nov 5, 2025Updated 3 months ago
- next generation slides for Jupyter Notebooks☆168Apr 17, 2024Updated last year
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆53Jul 3, 2018Updated 7 years ago
- ☆110Apr 17, 2017Updated 8 years ago
- Mirror of Apache Toree (Incubating)☆749Feb 21, 2026Updated last week
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,037Nov 21, 2022Updated 3 years ago
- Python helpers for building dashboards using Flask and React☆2,270Jun 2, 2025Updated 8 months ago
- Open source time series library for Python☆2,140Oct 24, 2023Updated 2 years ago
- A data science IDE for Python☆3,901Apr 16, 2018Updated 7 years ago
- Interactive tools and developer experiences for Big Data on Google Cloud Platform.☆969Sep 2, 2022Updated 3 years ago
- Visualize streaming machine learning in Spark☆177Jun 29, 2017Updated 8 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,783Aug 16, 2021Updated 4 years ago
- Write reproducible reports in Markdown☆440Dec 21, 2018Updated 7 years ago
- Parallel computing with task scheduling☆13,746Feb 22, 2026Updated last week
- Distributed Prometheus time series database☆1,462Updated this week
- Interactive and Reactive Data Science using Scala and Spark.☆3,152May 16, 2023Updated 2 years ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,753Dec 8, 2025Updated 2 months ago
- Interactive notebooks for trying analyses and exploring datasets☆32Aug 10, 2015Updated 10 years ago
- A probabilistic programming language in TensorFlow. Deep generative models, variational inference.☆4,845Mar 18, 2024Updated last year
- Partitioned storage system based on blosc. **No longer actively maintained.**☆156Nov 21, 2016Updated 9 years ago
- Quickly and accurately render even the largest data.☆3,514Feb 19, 2026Updated last week
- Beaker Extensions for Jupyter Notebook☆2,841Dec 4, 2023Updated 2 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Feb 9, 2021Updated 5 years ago
- A python tutorial on bayesian modeling techniques (PyMC3)☆2,508Apr 29, 2017Updated 8 years ago
- the portable Python dataframe library☆6,417Updated this week
- NumPy and Pandas interface to Big Data☆3,197Sep 29, 2023Updated 2 years ago
- Scala backend for IPython☆320Jan 9, 2019Updated 7 years ago
- rastercube is a python library for big data analysis of georeferenced time series data (e.g. MODIS NDVI)☆19Jul 27, 2017Updated 8 years ago
- friendly command-line tool for initializing python packages☆18Jun 24, 2020Updated 5 years ago