intuit / thriveLinks
Thrive is an ETL framework that runs single-row transformations on HDFS data and makes the data available in relational databases (Hive and Vertica).
☆10Updated 8 years ago
Alternatives and similar repositories for thrive
Users that are interested in thrive are comparing it to the libraries listed below
Sorting:
- A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support☆260Updated 8 years ago
- BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data its…☆939Updated 2 years ago
- Observations from Ian on successfully delivering data science products☆543Updated 4 years ago
- A Python implementation of Douglas Hofstadter formal systems, from his book "Gödel, Escher, Bach"☆626Updated 4 years ago
- Jari's collection of interesting papers.☆496Updated 3 weeks ago
- ☆263Updated 6 years ago
- VM based deployment for prototyping Big Data tools on Amazon Web Services☆129Updated 5 years ago
- ☆21Updated 8 years ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 6 years ago
- MacroBase: A Search Engine for Fast Data☆671Updated 3 years ago
- The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploit…☆745Updated 6 years ago
- Some notes on things I find interesting and important.☆2,101Updated last month
- ☆116Updated 9 months ago
- ☆461Updated 2 years ago
- ☆61Updated 7 years ago
- Some talks I think my friends would be interested in from pycon2016.☆16Updated 9 years ago
- Notes for the courses in the Machine Learning Specialization created by the University of Washingtion on Coursera☆45Updated 5 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆475Updated 8 years ago
- This is a repo documenting the best practices in PySpark.☆464Updated 3 years ago
- A scalable machine learning library on Apache Spark☆796Updated 4 years ago
- Foundations of Machine Learning☆340Updated last year
- Helping students assess course difficulty and workload.☆36Updated 8 years ago
- Standard evaluations for binary classifiers so you don't have to☆315Updated 7 years ago
- A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself. New implementa…☆888Updated 10 years ago
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆346Updated 4 years ago
- Hacker News: Crunching the Numbers☆73Updated 10 years ago
- Experiments towards neural network theorem proving☆789Updated 5 years ago
- Course materials for "Get Started with NLP in Python"☆62Updated 7 years ago
- Feature engineering and machine learning: together at last!☆25Updated 5 years ago
- A single handwritten digit classifier, using the MNIST dataset. Pure Numpy.☆787Updated 6 years ago