big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.
☆65Jun 3, 2020Updated 5 years ago
Alternatives and similar repositories for big_data_benchmarks
Users that are interested in big_data_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- An R package containing utilities for institutional researchers. This package is also used to support the Introduction to R and LaTeX doc…☆15Mar 13, 2019Updated 7 years ago
- Overleaf from your terminal — pull, push, two-way sync, compile, and manage LaTeX projects via CLI. Smart .olignore filtering, deletion p…☆58May 18, 2026Updated last week
- Mobile Artificial Intelligence Projects, published by Packt☆11Jan 30, 2023Updated 3 years ago
- A work-in-progress book on Dask☆12Jul 15, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Dec 15, 2024Updated last year
- Code Repository for Interactive Chatbots with TensorFlow[V], published by Packt☆20Jan 18, 2021Updated 5 years ago
- A Data Mesh demo repository☆13Oct 10, 2024Updated last year
- Spatial Data Analysis with High Performance Computing (HPC)☆10Oct 8, 2025Updated 7 months ago
- A minimal regression library for Julia☆12Apr 24, 2018Updated 8 years ago
- best practices and standards for the delivery of alternative data to the investment industry☆11Apr 21, 2026Updated last month
- make your statistical research faster☆12Jul 7, 2023Updated 2 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,504Apr 1, 2026Updated last month
- ☆23Aug 20, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Jun 29, 2021Updated 4 years ago
- Assignments and Projects for Udacity's Data Wrangling with MongoDB course☆16Oct 17, 2016Updated 9 years ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Describes how to comply to the CEOS CARD4L specifications (SAR and Optical) with STAC☆12Oct 17, 2023Updated 2 years ago
- Parse usercss styles supported by the Stylus userstyle manager☆17Mar 14, 2023Updated 3 years ago
- prosEO – A Processing System for Earth Observation Data☆19Updated this week
- Examples for Econ 712, Fall 2013☆16Feb 17, 2020Updated 6 years ago
- A shiny application in order to ease up planning for the next hiking or bike trip.☆12Jun 26, 2022Updated 3 years ago
- Supplementary xts functionality, and development platform for GSoC projects☆14Feb 9, 2015Updated 11 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Udacity Data Engineer Nanodegree - Capstone project☆11Dec 19, 2019Updated 6 years ago
- Introductory Statistics for Economists (Undergraduate Intro Course)☆17Jan 25, 2021Updated 5 years ago
- Tools for working with CSV files in IPython.☆10Feb 17, 2016Updated 10 years ago
- Spring Boot CRUD Rest APIs with Spring Data Cassandra☆15Apr 30, 2021Updated 5 years ago
- This course will provide a basic, yet rigorous, introduction to Time Series Econometrics. This course is intended for upper-level undergr…☆19May 9, 2019Updated 7 years ago
- Contain Interview Questions Solutions☆12May 18, 2018Updated 8 years ago
- Re-implementation of selected PolSARpro functions in Python, following the scientific recommendations of PolInSAR 2021 (Work In Progress)…☆20May 20, 2026Updated last week
- Demo notebook of Ibis for "Spark + Python + Dita science Festival"☆12Jul 28, 2016Updated 9 years ago
- RFC document, tooling and other content related to the array API standard☆267Apr 23, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Simulate infectious disease transmission with contact tracing☆17Updated this week
- Python Localstack Examples☆11Mar 3, 2026Updated 2 months ago
- A solution enabling customers to quickly deploy an architecture to identify and mask sensitive health data☆25Jul 6, 2023Updated 2 years ago
- Modelo de dissertação e teses em latex☆13Oct 23, 2017Updated 8 years ago
- Data Engineering Course Website☆14Apr 2, 2026Updated last month
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Mar 12, 2026Updated 2 months ago
- SNAP as a conda package☆13Jun 10, 2021Updated 4 years ago