big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.
☆65Jun 3, 2020Updated 5 years ago
Alternatives and similar repositories for big_data_benchmarks
Users that are interested in big_data_benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- Mobile Artificial Intelligence Projects, published by Packt☆11Jan 30, 2023Updated 3 years ago
- Using Python Tornado to serve Thrift HTTP requests☆13Dec 22, 2012Updated 13 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Dec 15, 2024Updated last year
- Linear regression modelling of the Ames housing dataset, with the goal of predicting the house sale price, as published in Towards Data S…☆10Oct 30, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code Repository for Interactive Chatbots with TensorFlow[V], published by Packt☆20Jan 18, 2021Updated 5 years ago
- A minimal regression library for Julia☆12Apr 24, 2018Updated 8 years ago
- best practices and standards for the delivery of alternative data to the investment industry☆11Apr 21, 2026Updated 2 weeks ago
- ☆42Oct 24, 2020Updated 5 years ago
- Apache Kafka Overview☆12Jun 9, 2023Updated 2 years ago
- ☆12Mar 26, 2018Updated 8 years ago
- make your statistical research faster☆12Jul 7, 2023Updated 2 years ago
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,505Apr 1, 2026Updated last month
- a major mode for emacs for editing n3 and turtle RDF☆14Dec 13, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 5 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Examples for Econ 712, Fall 2013☆16Feb 17, 2020Updated 6 years ago
- Supplementary xts functionality, and development platform for GSoC projects☆14Feb 9, 2015Updated 11 years ago
- Serverless hashtag recommendations using fastText and Python with AWS Lambda☆21Apr 12, 2018Updated 8 years ago
- Consulting Project with Manifold.co: Modeling System Resource Usage for Predictive Scheduling☆24Jul 8, 2018Updated 7 years ago
- Udacity Data Engineer Nanodegree - Capstone project☆11Dec 19, 2019Updated 6 years ago
- Introductory Statistics for Economists (Undergraduate Intro Course)☆17Jan 25, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This course will provide a basic, yet rigorous, introduction to Time Series Econometrics. This course is intended for upper-level undergr…☆19May 9, 2019Updated 6 years ago
- Contain Interview Questions Solutions☆12May 18, 2018Updated 7 years ago
- RFC document, tooling and other content related to the array API standard☆267Apr 23, 2026Updated last week
- Simulate infectious disease transmission with contact tracing☆17Apr 15, 2026Updated 3 weeks ago
- Python Localstack Examples☆11Mar 3, 2026Updated 2 months ago
- Julia Lang client for the ClickHouse TCP native protocol☆39Aug 8, 2024Updated last year
- A solution enabling customers to quickly deploy an architecture to identify and mask sensitive health data☆25Jul 6, 2023Updated 2 years ago
- Python Scripts for Backtesting SPX Put Strategies Using Black-Scholes Proxies☆14Feb 15, 2018Updated 8 years ago
- SNAP as a conda package☆13Jun 10, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks☆13Jun 16, 2020Updated 5 years ago
- Repository for CS282R: Robust Machine Learning at Harvard University.☆75Mar 30, 2018Updated 8 years ago
- An open source 3GPP LTE implementation. (GitHub import of https://sourceforge.net/projects/openlte/)☆10Mar 7, 2017Updated 9 years ago
- Fuzzy Data Benchmark☆18Feb 8, 2024Updated 2 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- Multi-task regression in Python☆25Feb 3, 2021Updated 5 years ago
- Code for An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality (ICLR 2020)☆11Mar 24, 2023Updated 3 years ago