Sqooba / snorkelLinks
Snorkel - Bootstrap your Data Science
☆24Updated 7 years ago
Alternatives and similar repositories for snorkel
Users that are interested in snorkel are comparing it to the libraries listed below
Sorting:
- Streaming tweets with spark, language detection & sentiment analysis, dashboard with Kibana☆104Updated 9 years ago
- ☆41Updated 8 years ago
- A python tool to manage developing and testing with lots of microservices☆59Updated 2 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Fusion demo app searching open-source project data from the Apache Software Foundation☆43Updated 6 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆62Updated last year
- Oracle Data Science Bootcamp 2014☆25Updated 10 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- Print an Elasticsearch inverted index as a CSV table or JSON object.☆11Updated last year
- machine learning playground☆12Updated 8 years ago
- Apache NiFi NLP Processor☆18Updated last year
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆61Updated 7 years ago
- Elasticsearch entity resolution plugin based on Duke☆209Updated 5 years ago
- Schedoscope is a scheduling framework for painfree agile development, testing, (re)loading, and monitoring of your datahub, lake, or what…☆97Updated 5 years ago
- functionstest☆33Updated 8 years ago
- Docker image for apache zeppelin☆38Updated 8 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆260Updated last year
- DEBS 2015 - Realtime Analytics Patterns with WSO2 CEP, Siddhi & Apache Storm☆16Updated 2 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Serverless proxy for Spark cluster☆325Updated 4 years ago
- Demonstrates NiFi template deployment and configuration via a REST API☆70Updated 8 years ago
- Scala SDK for working with Snowplow enriched events in Spark, AWS Lambda, Flink et al.☆21Updated 10 months ago
- Complete Pipeline Training at Big Data Scala By the Bay☆71Updated 9 years ago
- Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.☆34Updated 2 years ago
- A demo explaining how to use Zeppelin notebook to access Apache Cassandra data via Apache Spark or CQL language☆17Updated 4 years ago
- ☆107Updated 2 years ago
- NER tagger for English, Spanish, Dutch, Italian and German and French.☆35Updated 9 years ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago
- Tutorial on parsing Enron email to Avro and then explore the email set using Spark.☆52Updated last year