big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.
☆65Jun 3, 2020Updated 5 years ago
Alternatives and similar repositories for big_data_benchmarks
Users that are interested in big_data_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- An R package containing utilities for institutional researchers. This package is also used to support the Introduction to R and LaTeX doc…☆15Mar 13, 2019Updated 7 years ago
- ocr for historical data☆14Feb 23, 2025Updated last year
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Dec 15, 2024Updated last year
- Code Repository for Interactive Chatbots with TensorFlow[V], published by Packt☆21Jan 18, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Spatial Data Analysis with High Performance Computing (HPC)☆10Oct 8, 2025Updated 6 months ago
- A minimal regression library for Julia☆12Apr 24, 2018Updated 7 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,501Apr 1, 2026Updated 2 weeks ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 5 years ago
- This is the course papg of PhD level advanced macroeconomics.☆10Sep 13, 2021Updated 4 years ago
- Assignments and Projects for Udacity's Data Wrangling with MongoDB course☆16Oct 17, 2016Updated 9 years ago
- Parse usercss styles supported by the Stylus userstyle manager☆17Mar 14, 2023Updated 3 years ago
- prosEO – A Processing System for Earth Observation Data☆19Apr 9, 2026Updated last week
- A shiny application in order to ease up planning for the next hiking or bike trip.☆12Jun 26, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Udacity Data Engineer Nanodegree - Capstone project☆11Dec 19, 2019Updated 6 years ago
- Spring Boot CRUD Rest APIs with Spring Data Cassandra☆15Apr 30, 2021Updated 4 years ago
- SBT template for projects written in Scala and other JVM languages☆13Dec 29, 2021Updated 4 years ago
- Repository for the UTN BA Data Science Course 2020☆14Jun 28, 2021Updated 4 years ago
- RFC document, tooling and other content related to the array API standard☆266Mar 19, 2026Updated 3 weeks ago
- A solution enabling customers to quickly deploy an architecture to identify and mask sensitive health data☆25Jul 6, 2023Updated 2 years ago
- Modelo de dissertação e teses em latex☆13Oct 23, 2017Updated 8 years ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Mar 12, 2026Updated last month
- Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks☆13Jun 16, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Repository for CS282R: Robust Machine Learning at Harvard University.☆75Mar 30, 2018Updated 8 years ago
- An open source 3GPP LTE implementation. (GitHub import of https://sourceforge.net/projects/openlte/)☆10Mar 7, 2017Updated 9 years ago
- This repository contains FastAPI learning stuff☆36Nov 1, 2023Updated 2 years ago
- Multi-task regression in Python☆25Feb 3, 2021Updated 5 years ago
- Code for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality (ICLR 2020)☆11Mar 24, 2023Updated 3 years ago
- Pangeo & OpenEO Joint tutorial for BiDS23 - "Scaling Big Data Analysis with Pangeo and OpenEO: Unlocking the Power of Space Data"☆11Feb 29, 2024Updated 2 years ago
- (CRNN) Chinese Characters Recognition. add Backbone network resnet18 senet☆10Oct 20, 2021Updated 4 years ago
- ☆12Dec 30, 2020Updated 5 years ago
- Feature selection package of the mlr3 ecosystem.☆39Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- In which I learn about score functions and how they can be used to generate data.☆16Apr 5, 2024Updated 2 years ago
- bamboolib - template for creating your own binder notebook☆21Dec 14, 2021Updated 4 years ago
- ☆11Aug 7, 2022Updated 3 years ago
- A collection of python utility functions☆11Mar 30, 2026Updated 2 weeks ago
- A curated list of awesome Dash (plotly) resources☆11Nov 24, 2017Updated 8 years ago
- Show effects of over-subscription and ways to fix that☆16Aug 15, 2024Updated last year
- A Rust display driver for the SSD1327☆16Nov 22, 2021Updated 4 years ago