sirrice / dbtruckLinks
just put my data in a database!
☆39Updated 9 years ago
Alternatives and similar repositories for dbtruck
Users that are interested in dbtruck are comparing it to the libraries listed below
Sorting:
- ☆92Updated 9 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Updated 9 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Functional, Typesafe, Declarative Data Pipelines☆139Updated 7 years ago
- An open-source, vendor-neutral data context service.☆160Updated 7 years ago
- ☆110Updated 8 years ago
- Looking at big data? Add a little salt.☆59Updated 2 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 3 years ago
- Pig on Apache Spark☆82Updated 10 years ago
- HiScore makes creating sophisticated scores easy☆216Updated 4 years ago
- Standard evaluations for binary classifiers so you don't have to☆315Updated 6 years ago
- One way of using Plot.ly on Zeppelin notebooks☆28Updated 9 years ago
- ☆84Updated 7 years ago
- A platform for real-time streaming search☆102Updated 9 years ago
- A Python wrapper over the GraphGen system☆37Updated 7 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆259Updated last year
- Lossy Counting and Sticky Sampling implementation for efficient frequency counts on data streams.☆63Updated 9 years ago
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆238Updated this week
- Live-updating Spark UI built with Meteor☆189Updated 4 years ago
- Distributed decision tree ensemble learning in Scala☆391Updated 6 years ago
- Simplifying robust end-to-end machine learning on Apache Spark.☆473Updated 8 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆167Updated 4 years ago
- MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.☆507Updated 7 years ago
- MLeap allows for easily putting Spark ML pipelines into production☆78Updated 8 years ago
- Serving system for batch generated data sets☆176Updated 8 years ago
- zenvisage's foundational framework☆69Updated 2 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- S3 backed ContentsManager for jupyter notebooks☆14Updated 9 years ago