big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.
☆65Jun 3, 2020Updated 6 years ago
Alternatives and similar repositories for big_data_benchmarks
Users that are interested in big_data_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- Mobile Artificial Intelligence Projects, published by Packt☆11Jan 30, 2023Updated 3 years ago
- A low-level execution library for analytic data processing.☆32May 9, 2024Updated 2 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- Code Repository for Interactive Chatbots with TensorFlow[V], published by Packt☆20Jan 18, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Data Mesh demo repository☆13Oct 10, 2024Updated last year
- Spatial Data Analysis with High Performance Computing (HPC)☆10Oct 8, 2025Updated 8 months ago
- A minimal regression library for Julia☆12Apr 24, 2018Updated 8 years ago
- ☆42Oct 24, 2020Updated 5 years ago
- Apache Kafka Overview☆12Jun 9, 2023Updated 3 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,504Apr 1, 2026Updated 2 months ago
- ☆10Jun 29, 2021Updated 4 years ago
- Assignments and Projects for Udacity's Data Wrangling with MongoDB course☆16Oct 17, 2016Updated 9 years ago
- Styles for TaskPaper 3☆11Jan 27, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Examples for Econ 712, Fall 2013☆16Feb 17, 2020Updated 6 years ago
- A shiny application in order to ease up planning for the next hiking or bike trip.☆12Jun 26, 2022Updated 3 years ago
- Supplementary xts functionality, and development platform for GSoC projects☆14Feb 9, 2015Updated 11 years ago
- Tools for working with CSV files in IPython.☆10Feb 17, 2016Updated 10 years ago
- Spring Boot CRUD Rest APIs with Spring Data Cassandra☆15Apr 30, 2021Updated 5 years ago
- ☆11Mar 28, 2024Updated 2 years ago
- Re-implementation of selected PolSARpro functions in Python, following the scientific recommendations of PolInSAR 2021 (Work In Progress)…☆20Jun 1, 2026Updated 2 weeks ago
- Simulate infectious disease transmission with contact tracing☆17Jun 8, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Dec 10, 2019Updated 6 years ago
- A solution enabling customers to quickly deploy an architecture to identify and mask sensitive health data☆25Jul 6, 2023Updated 2 years ago
- A cloud native data mesh implementation☆12Jan 15, 2021Updated 5 years ago
- This repository includes two jupyter notebooks. The first one retrains the already pre-trained ResNet-50 using transfer learning in order…☆10Jul 23, 2020Updated 5 years ago
- Data Engineering Course Website☆14Apr 2, 2026Updated 2 months ago
- Based on wurstmeister's kafka-docker, with Prometheus JMX Exporter included☆12Nov 24, 2016Updated 9 years ago
- Examples for computing regression standard errors in Python with statsmodels☆14Feb 1, 2024Updated 2 years ago
- Fuzzy Data Benchmark☆18Feb 8, 2024Updated 2 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multi-task regression in Python☆25Feb 3, 2021Updated 5 years ago
- A lightweight, continuously-updated catalog of research papers on AI agents.☆29Oct 13, 2025Updated 8 months ago
- Pangeo & OpenEO Joint tutorial for BiDS23 - "Scaling Big Data Analysis with Pangeo and OpenEO: Unlocking the Power of Space Data"☆11Feb 29, 2024Updated 2 years ago
- Enclosures for Smart Citizen☆18May 14, 2026Updated last month
- Instructor: Xiaojiang Li☆17Sep 28, 2023Updated 2 years ago
- Source code related to the article "Deep splitting method for parabolic PDEs" by Christian Beck, Sebastian Becker, Patrick Cheridito, Arn…☆15Nov 1, 2020Updated 5 years ago
- PyTorch Flexible Hash Embeddings☆29Feb 4, 2020Updated 6 years ago