Sqooba / snorkel
Snorkel - Bootstrap your Data Science
☆24Updated 7 years ago
Alternatives and similar repositories for snorkel:
Users that are interested in snorkel are comparing it to the libraries listed below
- machine learning playground☆12Updated 8 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Avro Schema Shredder is a REST API that enables storage of Avro Schemas in Apache Atlas. This API enables an organization to use Apache A…☆13Updated 8 years ago
- ☆13Updated last year
- phData Pulse application log aggregation and monitoring☆13Updated 5 years ago
- Apache Spark under Docker☆9Updated 8 years ago
- Chorus, now for Elasticsearch!☆16Updated 10 months ago
- Sandbox for Apache nifi☆24Updated 3 years ago
- Cascading on Apache Flink®☆54Updated last year
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆96Updated 5 years ago
- Apache NiFi Custom Processor for working with Stanford CoreNLP for Sentiment Analysis in Java 8☆11Updated 6 years ago
- Extract statistics from Wikipedia Dump files.☆26Updated 3 years ago
- Fusion demo app searching open-source project data from the Apache Software Foundation☆42Updated 6 years ago
- Scala port of the word2vec toolkit.☆11Updated 8 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- NiFi provenance reporting tasks☆14Updated last year
- Groovy client library for Apache Ambari's REST API☆20Updated 3 years ago
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 10 years ago
- Scala utilities for teaching computational linguistics and prototyping algorithms.☆42Updated 12 years ago
- ☆15Updated 7 years ago
- Data pipeline automation tool☆26Updated last year
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 3 months ago
- An analysis of adverse drug event data using Hadoop, R, and Gephi☆44Updated 9 years ago
- Twitter Streaming API Example with Kafka Streams in Scala☆49Updated 8 years ago
- Common components used across the datamountaineer kafka connect connectors☆21Updated 4 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- ☆41Updated 7 years ago
- A collection of datasets and databases☆24Updated 6 years ago