sirrice / dbtruckLinks
just put my data in a database!
☆39Updated 9 years ago
Alternatives and similar repositories for dbtruck
Users that are interested in dbtruck are comparing it to the libraries listed below
Sorting:
- ☆92Updated 10 years ago
- A prototype of Hive UDFs/UDTFs that execute nested SQL queries within rows.☆54Updated 10 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Myria is a scalable Analytics-as-a-Service platform based on relational algebra.☆116Updated 4 years ago
- An open-source, vendor-neutral data context service.☆160Updated 7 years ago
- A platform for real-time streaming search☆102Updated 9 years ago
- ☆84Updated 7 years ago
- Utils around luigi.☆66Updated 4 months ago
- zenvisage's foundational framework☆70Updated 3 years ago
- Material for some talks I have given☆62Updated last year
- Luigi Plugin for Hubot☆36Updated 9 years ago
- ☆110Updated 8 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆155Updated 9 years ago
- A Python wrapper over the GraphGen system☆37Updated 8 years ago
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆242Updated 3 weeks ago
- Standard evaluations for binary classifiers so you don't have to☆315Updated 7 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆111Updated 10 years ago
- Generates more or less realistic log data for testing simple aggregation queries.☆263Updated 2 years ago
- Portland Python Meetup March 2015☆40Updated 10 years ago
- One way of using Plot.ly on Zeppelin notebooks☆28Updated 9 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- Distributed DataFrame: Productivity = Power x Simplicity For Scientists & Engineers, on any Data Engine☆166Updated 4 years ago
- Serving system for batch generated data sets☆177Updated 8 years ago
- Code to transform Hillary's emails from raw PDF documents to a SQLite database☆161Updated 10 years ago
- An experimental hosted platform (GitHub-like) for organizing, managing, sharing, collaborating, and making sense of data.☆211Updated 7 years ago
- S3 backed ContentsManager for jupyter notebooks☆14Updated 9 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 10 years ago
- Looking at big data? Add a little salt.☆59Updated 2 years ago
- SociaLite: query language for large-scale graph analysis and data mining☆111Updated 9 years ago