toddwschneider / nyc-taxi-data
Import public NYC taxi and for-hire vehicle (Uber, Lyft) trip data into a PostgreSQL or ClickHouse database
☆2,018Updated 11 months ago
Alternatives and similar repositories for nyc-taxi-data:
Users that are interested in nyc-taxi-data are comparing it to the libraries listed below
- Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission☆717Updated 2 years ago
- NYC Citi Bike system data and analysis☆245Updated 11 months ago
- A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2…☆1,879Updated 2 years ago
- Anomaly Detection with R☆3,592Updated 5 years ago
- Breakout Detection via Robust E-Statistics☆758Updated 7 years ago
- MacroBase: A Search Engine for Fast Data☆665Updated 2 years ago
- Import and analyze Chicago public taxi and ride-hailing data☆85Updated 5 years ago
- Data workflow tool, like a "Make for data"☆1,483Updated 2 years ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,743Updated 3 years ago
- USC urban data science course series in Python☆1,278Updated 5 months ago
- A Jupyter notebook extension for geospatial visualization and analysis☆1,081Updated 6 years ago
- Forecasting Functions for Time Series and Linear Models☆1,137Updated 7 months ago
- An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.☆4,063Updated 3 years ago
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,782Updated 3 years ago
- Geo Spatial Data Analytics on Spark☆532Updated 3 years ago
- A library for time series analysis on Apache Spark☆1,192Updated 4 years ago
- A scalable machine learning library on Apache Spark☆793Updated 3 years ago
- R frontend for Spark☆641Updated 8 years ago
- Platform for building statistical models of cities and regions☆495Updated last year
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆349Updated 4 years ago
- PySpark + Scikit-learn = Sparkit-learn☆1,154Updated 4 years ago
- Lifetime value in Python☆1,462Updated 9 months ago
- Data and code behind the Economist's Graphic Detail section.☆401Updated 4 years ago
- Urban Informatics and Visualization (UC Berkeley CP255)☆228Updated 6 years ago
- A data science IDE for Python☆3,919Updated 6 years ago
- Data Visualization Server☆962Updated 8 years ago
- nyc taxi data☆73Updated 8 years ago
- Repo for NYC Taxis: A Day in the Life, a data visualization that shows the movements and earnings of a single NYC taxi over 24 hours.☆454Updated 10 months ago
- A python tutorial on bayesian modeling techniques (PyMC3)☆2,490Updated 7 years ago
- A package for plotting maps in R with ggplot2☆771Updated last year