toddwschneider / nyc-taxi-data
Import public NYC taxi and for-hire vehicle (Uber, Lyft) trip data into a PostgreSQL or ClickHouse database
☆2,001Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for nyc-taxi-data
- Uber trip data from a freedom of information request to NYC's Taxi & Limousine Commission☆713Updated 2 years ago
- NYC Citi Bike system data and analysis☆243Updated 6 months ago
- A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2…☆1,869Updated 2 years ago
- Anomaly Detection with R☆3,567Updated 5 years ago
- A data science IDE for Python☆3,924Updated 6 years ago
- An exhaustive reference to problems seen in real-world data along with suggestions on how to resolve them.☆4,057Updated 3 years ago
- SFrame: Scalable tabular and graph data-structures built for out-of-core data analysis and machine learning.☆890Updated 6 years ago
- Import and analyze Chicago public taxi and ride-hailing data☆82Updated 4 years ago
- A proofreader for your data☆691Updated last year
- Interactive JS Charts from R☆1,191Updated 8 years ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,742Updated 3 years ago
- python toolbox for visualizing geographical data and making maps☆1,027Updated 2 years ago
- HeavyDB (formerly OmniSciDB)☆2,957Updated 2 months ago
- Loan-level analysis of Fannie Mae and Freddie Mac data☆216Updated 4 years ago
- Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course☆350Updated 3 years ago
- Breakout Detection via Robust E-Statistics☆755Updated 7 years ago
- A collection of public data sets☆511Updated 9 months ago
- A Python data analysis library that is optimized for humans instead of machines.☆1,173Updated 3 months ago
- dplyr for python☆764Updated 7 years ago
- ggplot port for python☆3,700Updated last year
- Practical tutorials and labs for TensorFlow used by Nvidia, FFN, CNN, RNN, Kaggle, AE☆1,946Updated 8 years ago
- Interactive NBA and NCAA Shot Charts with R and Shiny☆598Updated last year
- A library for reading text files over multiple cores.☆1,060Updated last year
- A fast, offline reverse geocoder in Python☆1,875Updated last year
- ☆459Updated last year
- Plotting library for IPython/Jupyter notebooks☆3,627Updated this week
- A list of awesome interactive journalism projects.☆1,901Updated 5 years ago
- Please visit https://github.com/h2oai/h2o-3 for latest H2O☆2,224Updated 3 weeks ago
- Run MapReduce jobs on Hadoop or Amazon Web Services☆2,614Updated last year
- PySpark + Scikit-learn = Sparkit-learn☆1,155Updated 3 years ago