KaveIO / KaveToolbox
Data analytics toolkit part of the KAVE, installable stand-alone
☆16Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for KaveToolbox
- ☆33Updated 10 years ago
- A small extension of Ambari to support KAVE services installed into a cluster☆20Updated 5 years ago
- Code examples supporting the "Introduction to Apache Spark" video published by O'Reilly Media☆37Updated 2 years ago
- S3 backed ContentsManager for jupyter notebooks☆13Updated 8 years ago
- BigML Components for Talend Open Studio☆20Updated 7 years ago
- A collection of datasets and databases☆24Updated 6 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 9 years ago
- A simple example of containerized data science with python and Docker.☆51Updated 6 years ago
- Proposals for new Jupyter subprojects to enter into incubation☆18Updated 4 years ago
- ☆15Updated 7 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- CustomerML is an open source customer science platform leveraging the power of Predictiveworks and fully integrated with Elasticsearch an…☆47Updated 9 years ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- Template for developing new or documenting existing predictive systems that are based on machine learning techniques. Currently in HTML.☆52Updated 5 years ago
- A Binder-compatibible repo with a Dockerfile☆11Updated 7 years ago
- Class files for Fast Track to Python☆20Updated 10 years ago
- Snippets of code used in blog posts and other media.☆13Updated last year
- Tool to cleanse and semantify datasets from CKAN repositories. Based on OpenRefine.☆23Updated 8 years ago
- Light-weight, Python-based data-analysis framework☆12Updated 5 years ago
- Deploy an interactive data science environment with JupyterHub on Docker Swarm☆21Updated 8 years ago
- Building Python Data Application Tutorials☆23Updated 3 months ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆57Updated 3 years ago
- Material for some talks I have given☆62Updated 2 months ago
- personal cheatsheets on various technologies☆25Updated 8 years ago
- A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.☆38Updated 5 years ago
- PredictionIO Classification Engine Template (Scala-based parallelized engine)☆39Updated 5 years ago
- Python bindings for Matroid API☆16Updated last month
- Using terraform, deploy multiple dataproc clusters using a shared hive metastore☆14Updated 2 years ago