twosigma / flint
A Time Series Library for Apache Spark
☆1,004Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for flint
- A library for time series analysis on Apache Spark☆1,194Updated 4 years ago
- MLeap: Deploy ML Pipelines to Production☆1,504Updated 4 months ago
- Sparkling Water provides H2O functionality inside Spark cluster☆967Updated this week
- Mirror of Apache Toree (Incubating)☆740Updated this week
- Sparkling Pandas☆361Updated last year
- A scalable machine learning library on Apache Spark☆792Updated 3 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,329Updated 2 weeks ago
- ☆511Updated 2 years ago
- ☆999Updated this week
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,009Updated 2 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆469Updated 7 years ago
- PySpark + Scikit-learn = Sparkit-learn☆1,154Updated 3 years ago
- Robustly estimate trend and periodicity in a timeseries.☆373Updated 6 years ago
- Turbodbc is a Python module to access relational databases via the Open Database Connectivity (ODBC) interface. The module complies with …☆623Updated last week
- The missing MatPlotLib for Scala + Spark☆730Updated 2 years ago
- The Internals of Spark Structured Streaming☆415Updated last year
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆188Updated 5 years ago
- A Scala feature transformation library for data science and machine learning☆467Updated 2 months ago
- A tool for monitoring and tuning Spark jobs for efficiency.☆357Updated 2 years ago
- Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks☆359Updated 7 years ago
- A Scala kernel for Jupyter☆1,594Updated this week
- Joblib Apache Spark Backend☆242Updated 2 months ago
- A columnar data container that can be compressed.☆959Updated 2 years ago
- python implementation of the parquet columnar file format.☆784Updated 2 weeks ago