☆17Jun 23, 2024Updated last year
Alternatives and similar repositories for one_billion_row_challenge_python
Users that are interested in one_billion_row_challenge_python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains an example of how to leverage Cloud Composer and Cloud Dataflow to move data from a Microsoft SQL Server to BigQ…☆20Mar 25, 2026Updated 2 months ago
- Plane moment analysis with Apache Flink complex event processing☆18Jun 14, 2025Updated 11 months ago
- Part-of-speech tagger for the English language☆10Jul 31, 2018Updated 7 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Rust + Python Lake House Health Analyzer | Detect • Diagnose • Optimize • Flow☆65Oct 20, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Deploy a complete data stack in just a couple of minutes.☆15Mar 6, 2024Updated 2 years ago
- Deployed an kafka instance in AWS EC2 Instance to streamline the data into Databricks☆10Aug 15, 2023Updated 2 years ago
- Save and load entire workspaces containins pandas objects and numpy arrays☆15Oct 5, 2018Updated 7 years ago
- 🚗 Downloads a Google Drive folder that you can query with gatsby-source-filesystem.☆12Mar 2, 2023Updated 3 years ago
- This repo contains all the codes and sample files for the "Short and Long-term Pattern Discovery Over Large-Scale Geo-Spatiotemporal Data…☆13May 19, 2022Updated 4 years ago
- Exploring some issues related to churn☆17Mar 19, 2024Updated 2 years ago
- This project contain build end-to-end e-commerce data from data source into data warehouse and visualization.☆13Sep 5, 2024Updated last year
- NoSQL extract, transform, load (ETL) toolkit with Python☆16Updated this week
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆23Dec 18, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Gatsby transformer plugin for jupyter notebooks☆10Jan 7, 2019Updated 7 years ago
- ☆18May 20, 2026Updated 3 weeks ago
- Most recent/important talks given at conferences/meetups☆14Nov 27, 2020Updated 5 years ago
- R interface to Kusto/Azure Data Explorer. Submit issues and PRs at https://github.com/Azure/AzureKusto☆18Oct 13, 2023Updated 2 years ago
- Sample code for a Rasa virtual assistant with an Alexa connector.☆19Apr 6, 2022Updated 4 years ago
- This project supplies utility classes and functions for dynamic loading and management of data from designated directories. As well as co…☆16Mar 5, 2024Updated 2 years ago
- Terraform module to deploy Apache Druid in Kubernetes☆18Sep 2, 2020Updated 5 years ago
- Corona Invader is a replica of the famous Space Invader, with a twist. In the game, you impersonate the Unites States, and you will have …☆12Sep 10, 2020Updated 5 years ago
- Predicting the medal table of the Summer Games☆12Jul 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Python code for Bayesian Conditional Cointegration☆18May 28, 2017Updated 9 years ago
- CS 489/698 Big Data Infrastructure (Winter 2017) at the University of Waterloo☆15Apr 17, 2017Updated 9 years ago
- ☆13May 3, 2022Updated 4 years ago
- A markdown wiki and dashboarding system for Datasette☆21Nov 2, 2021Updated 4 years ago
- Streamlit App to create interactive visualizations☆12Jun 21, 2024Updated last year
- python talk on data visualizations - focused on matplotlib and bokeh☆14Apr 14, 2026Updated last month
- 【Python / Streamlit】Pokemon Sleep 小幫手(寶可夢潛力計算、食譜篩選、寶可夢資訊)☆14May 4, 2024Updated 2 years ago
- docker images for class☆10Jul 27, 2021Updated 4 years ago
- an end-to-end data pipeline extracting music listening habits and producing an insightful dashboard☆18Mar 31, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Making a Class Schedule Using a Genetic Algorithm with Python☆45May 8, 2026Updated last month
- Tutorial for easy-to-manage data pipelines with Airflow☆10Jun 26, 2022Updated 3 years ago
- dbt plugin for Palm CLI☆20Mar 20, 2024Updated 2 years ago
- Archive of talks for SF Python meetups☆14Jun 13, 2019Updated 6 years ago
- Landing Page for Pycon ID 2020☆12Aug 29, 2021Updated 4 years ago
- Discover the perfect harmony of tunes and movies!☆10Aug 17, 2023Updated 2 years ago
- Source code for the PyPi package with the same name