deanmalmgren / flo
enable rapid iteration and development of complex data pipelines
☆28Updated 6 years ago
Alternatives and similar repositories for flo:
Users that are interested in flo are comparing it to the libraries listed below
- Utils around luigi.☆65Updated 4 years ago
- A Python version (almost a port) of ProPublica's TableFu☆233Updated 11 years ago
- A polite, minimal interface for sending python objects to and from Amazon S3.☆57Updated 8 years ago
- Simple spill-to-disk dictionary☆60Updated 3 years ago
- JSON -> Relational DB Column Types☆63Updated 2 years ago
- Creating Rickshaw.js visualizations with Python Pandas☆265Updated 8 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- Portland Python Meetup March 2015☆40Updated 9 years ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆123Updated 9 years ago
- Utilities for data cleaning and ETL processing☆24Updated 7 years ago
- Simple spill-to-disk dictionary☆17Updated 8 years ago
- More tools for Python☆51Updated 5 years ago
- A python package for defensive data analysis.☆17Updated 9 years ago
- Task Orchestration Tool Based on SWF and boto3☆38Updated 6 years ago
- IPython Notebook + D3☆128Updated 10 years ago
- Create and manage instances for data science☆20Updated 9 years ago
- Fetch and plot AWS spot pricing history☆23Updated 8 years ago
- Utilities for dealing with URIs, invented and maintained by Yelp.☆14Updated last year
- S3 backed ContentsManager for jupyter notebooks☆13Updated 9 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆152Updated 8 years ago
- A Python library for dealing with splittable files☆42Updated 5 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- Data analysis tool.☆84Updated last year
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆239Updated last week
- Streaming newline delimited JSON I/O.☆12Updated last year
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- workflow support for reproducible deduplication and merging☆16Updated last year
- Scheduled task execution on top of AWS Data Pipeline☆43Updated 9 years ago
- Dat python client☆46Updated 8 years ago