xmlking / ml-experiments
machine learning playground
☆12Updated 8 years ago
Alternatives and similar repositories for ml-experiments:
Users that are interested in ml-experiments are comparing it to the libraries listed below
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Use Cascading Taps and Scalding DSL with Spark☆49Updated 8 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- ☆12Updated 8 years ago
- Flink stream filtering examples☆19Updated 8 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- A simple Twitter-Streaming Application for Apache Flink☆21Updated 9 years ago
- Dependency and data pipeline management framework for Spark and Scala☆15Updated 8 years ago
- Cascading on Apache Flink®☆54Updated last year
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Sample App. Amazon Product Descriptions Wordcloud. Spark Streaming, Algebird, Storehaus, Redis, Scala Scraper, OpenNLP, Play Framework, D…☆12Updated 9 years ago
- functionstest☆33Updated 8 years ago
- Sketching data structures for scala, including t-digest☆15Updated 3 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- An asynchronous Scala wrapper around the Apache Curator Framework.☆16Updated 9 years ago
- Supporting material (code, schemas etc) for Unified Log Processing (Manning Publications)☆98Updated 2 years ago
- Twitter Streaming API Example with Kafka Streams in Scala☆49Updated 8 years ago
- A framework for creating composable and pluggable data processing pipelines using Apache Spark, and running them on a cluster.☆47Updated 8 years ago
- something to help you spark☆65Updated 6 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Common components used across the datamountaineer kafka connect connectors☆21Updated 4 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Experiments with the GDELT dataset and Cassandra schemas.☆25Updated 9 years ago
- Interactive Audience Analytics with Spark and HyperLogLog☆55Updated 9 years ago
- Data-Driven Spark allows quick data exploration based on Apache Spark.☆28Updated 8 years ago
- Data Science with Apache Spark and Spark Notebook☆30Updated 7 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 7 months ago
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago