A fast way of getting a Spark cluster up and running on AWS with the friendly IPython interface.
☆10May 8, 2015Updated 10 years ago
Alternatives and similar repositories for sparknotebook
Users that are interested in sparknotebook are comparing it to the libraries listed below
Sorting:
- A website for sharing machine learning datasets☆21Mar 25, 2016Updated 9 years ago
- European Parliament website Python scraper☆12Oct 19, 2016Updated 9 years ago
- OSoMe API mashups☆11Jan 29, 2019Updated 7 years ago
- 🥪💾 A sample of data from the `jaffle-shop-generator` that powers the Jaffle Shop spanning one year.☆15Jan 23, 2025Updated last year
- Rank Aggregation Algorithms☆12Jul 22, 2014Updated 11 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆37Mar 30, 2015Updated 10 years ago
- The Data Product Specification☆11Jan 28, 2025Updated last year
- Command-line tool for building Gephi force-directed graph diagrams.☆10Nov 10, 2017Updated 8 years ago
- ☆11Aug 4, 2022Updated 3 years ago
- Manage Unity Catalog tables with Pydantic Models☆10Mar 5, 2025Updated 11 months ago
- Workshop materials for scraping Twitter with Python☆13May 25, 2016Updated 9 years ago
- Scala library for parsing fixed length file format☆13Oct 19, 2021Updated 4 years ago
- Implementation of data dimensionality reduction algorithms SVD and CUR without using library functions.☆10Jul 24, 2017Updated 8 years ago
- Github action for running python unit tests☆10Jun 16, 2025Updated 8 months ago
- Playground site for creating/validating data contracts☆11Aug 9, 2025Updated 6 months ago
- prebuilt configurations for docker-rpm-builder☆11Feb 5, 2021Updated 5 years ago
- Architecture principles☆13May 23, 2025Updated 9 months ago
- A collection of practical circom circuits☆14May 19, 2022Updated 3 years ago
- The Tweets2013 Internet Archive collection☆10Aug 7, 2020Updated 5 years ago
- Python module for sending metrics to Graphite over UDP☆26Jan 12, 2022Updated 4 years ago
- Event data in a box, basically.☆15Nov 4, 2014Updated 11 years ago
- Rackspace / Open Stack Cloud Files Erlang Client☆27Mar 16, 2013Updated 12 years ago
- Clustering library for Elixir☆12Jul 9, 2017Updated 8 years ago
- Pachyderm/MLeap team up to provide versioned datasets + models☆10Jun 7, 2017Updated 8 years ago
- An implementation of Dijkstra in Clojure☆19Aug 7, 2012Updated 13 years ago
- Repository with materials for HSE minor students (group iad16)☆15Jan 26, 2017Updated 9 years ago
- Price options by fitting a Lévy distribution☆10Jan 20, 2021Updated 5 years ago
- A moment-free estimator of the Sharpe (signal-to-noise) ratio.☆12Dec 27, 2022Updated 3 years ago
- The COVID-19 Digital Observatory collects, aggregates, and distributes data from social media, search engine results, and Wikipedia to su…☆11Dec 17, 2020Updated 5 years ago
- Repository for AWS Workshop on IoT. Control and collect data from a swarm of IoT devices simulated locally.☆10Jun 10, 2017Updated 8 years ago
- Ansible Role for JBoss/Wildfly and JBoss-based products.☆11Oct 7, 2019Updated 6 years ago
- Operating documents for the technical steering committee.☆15Updated this week
- vcsh config base repository (required before all others)☆14Jun 20, 2017Updated 8 years ago
- Collaborative web framework for analyzing text (e.g., tweets). Supports standard labeling and pairwise comparison.☆14Sep 15, 2021Updated 4 years ago
- Temporary repository for Japanese☆11Apr 25, 2019Updated 6 years ago
- A TCP replication server, or broadcaster, that replicates TCP commands to other TCP servers☆30Sep 15, 2021Updated 4 years ago
- Blind second price auction for ERC721 mints☆11Feb 7, 2023Updated 3 years ago
- Setup Apache Airflow on Kubernetes☆10Jul 20, 2018Updated 7 years ago
- Recover temporal information from grown trees, using Python☆11Mar 11, 2021Updated 4 years ago