toddwschneider / nyc-taxi-data
Import public NYC taxi and for-hire vehicle (Uber, Lyft) trip data into a PostgreSQL or ClickHouse database
☆2,024Updated last year
Alternatives and similar repositories for nyc-taxi-data:
Users that are interested in nyc-taxi-data are comparing it to the libraries listed below
- Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission☆719Updated 2 years ago
- Anomaly Detection with R☆3,593Updated 5 years ago
- An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.☆4,064Updated 3 years ago
- Import and analyze Chicago public taxi and ride-hailing data☆85Updated 5 years ago
- ggplot port for python☆3,700Updated 2 years ago
- Samples for users of the Yelp Academic Dataset☆1,253Updated last year
- Interactive NBA and NCAA Shot Charts with R and Shiny☆605Updated 2 years ago
- A scalable machine learning library on Apache Spark☆793Updated 3 years ago
- Quickly and accurately render even the largest data.☆3,398Updated 2 weeks ago
- A library for reading text files over multiple cores.☆1,055Updated last year
- ☆304Updated 5 years ago
- A proofreader for your data☆693Updated 2 years ago
- Breakout Detection via Robust E-Statistics☆759Updated 7 years ago
- Code accompanying the book "Machine Learning for Hackers"☆3,679Updated 5 years ago
- dplyr for python☆762Updated 8 years ago
- Public material for CS109☆1,480Updated 2 years ago
- http://DataScienceSpecialization.github.io☆1,643Updated 4 years ago
- Visualizing MBTA Data☆1,057Updated 2 years ago
- PlanOut is a library and interpreter for designing online experiments.☆1,687Updated 4 years ago
- Loan-level analysis of Fannie Mae and Freddie Mac data☆219Updated 5 years ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,744Updated 3 years ago
- A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2…☆1,882Updated 2 years ago
- python toolbox for visualizing geographical data and making maps☆1,034Updated 2 years ago
- Geo Spatial Data Analytics on Spark☆532Updated 3 years ago
- Course materials for the Data Science Specialization: https://www.coursera.org/specialization/jhudatascience/1☆4,087Updated 4 years ago
- NumPy and Pandas interface to Big Data☆3,199Updated last year
- HeavyDB (formerly OmniSciDB)☆2,991Updated 7 months ago
- An implementation of the Grammar of Graphics in R☆6,665Updated this week
- Data sets created for stories on The Pudding, open to the public.☆1,012Updated 5 months ago
- Forecasting Functions for Time Series and Linear Models☆1,138Updated last week