mindfulmachines / ladydiLinks
Code Less, Build More. Clean, automated Feature Generation and Selection for Apache Spark!
☆13Updated 8 years ago
Alternatives and similar repositories for ladydi
Users that are interested in ladydi are comparing it to the libraries listed below
Sorting:
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Apache Hadoop HDFS Data Node Scheduler☆13Updated 9 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29Updated 5 years ago
- Cascading on Apache Flink®☆54Updated last year
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Kafka Connect Cassandra Connector. This project includes source/sink connectors for Cassandra to/from Kafka.☆78Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 10 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆29Updated 8 years ago
- Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 5 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
- Few things we've met during our etl project based on spark☆24Updated 7 years ago
- Advanced Analytics Engine for NoSQL Data☆403Updated 11 years ago
- Aerospike Spark Connector☆35Updated 8 years ago
- Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.☆112Updated 5 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 10 years ago
- Apache Yarn cluster docker image☆35Updated 7 years ago
- Mirror of Apache Iota (Incubating)☆34Updated 8 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 9 years ago
- A connector for SingleStore and Spark☆162Updated 3 weeks ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Updated 10 years ago
- A utility for generating Oozie workflows from a YAML definition☆49Updated 6 years ago
- functionstest☆33Updated 8 years ago
- Spark job for compacting avro files together☆12Updated 7 years ago
- Practical examples of using Apache Spark in several different use cases☆102Updated 9 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- NuCypher for Kafka. Start building from this module (it fetches the appropriate branch from Kafka repository)☆18Updated 8 years ago