big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.
☆65Jun 3, 2020Updated 5 years ago
Alternatives and similar repositories for big_data_benchmarks
Users that are interested in big_data_benchmarks are comparing it to the libraries listed below
Sorting:
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- An R package containing utilities for institutional researchers. This package is also used to support the Introduction to R and LaTeX doc…☆15Mar 13, 2019Updated 6 years ago
- The easiest way to create jamstack sites, as simple or as complex as you like☆10Apr 10, 2022Updated 3 years ago
- Mobile Artificial Intelligence Projects, published by Packt☆11Jan 30, 2023Updated 3 years ago
- ☆42Oct 24, 2020Updated 5 years ago
- Examples for Econ 712, Fall 2013☆16Feb 17, 2020Updated 6 years ago
- A minimal regression library for Julia☆12Apr 24, 2018Updated 7 years ago
- Assignments and Projects for Udacity's Data Wrangling with MongoDB course☆16Oct 17, 2016Updated 9 years ago
- JVM integration for Weld☆16Sep 24, 2018Updated 7 years ago
- bamboolib - template for creating your own binder notebook☆21Dec 14, 2021Updated 4 years ago
- Multi-task regression in Python☆25Feb 3, 2021Updated 5 years ago
- Code and examples for O'Reilly's Data Wrangling with Python video course☆28Jun 8, 2016Updated 9 years ago
- IR Code Sharing for the Canadian Institutional Research and Planning Association☆13May 30, 2025Updated 9 months ago
- Course Materials for Practical Data Analysis with Python and SQL☆34Aug 4, 2024Updated last year
- ☆10Jun 29, 2021Updated 4 years ago
- Repository for the Statistical Modeling & Causal Inference 2020-I Tutorial☆28May 17, 2020Updated 5 years ago
- Course materials for UMBC DATA 690 - Statistical Analysis and Data Visualization with Python.☆12Dec 5, 2024Updated last year
- Statistical modeling lies at the heart of data science. Well crafted statistical models allow data scientists to draw conclusions about t…☆11Jan 21, 2026Updated last month
- spark-sight: Spark performance at a glance☆10Apr 6, 2023Updated 2 years ago
- Various machine learning approaches are widely applied for short-term solar power forecasting, which is highly demanded for renewable ene…☆13Feb 18, 2020Updated 6 years ago
- ☆11Dec 17, 2025Updated 2 months ago
- Cylon is a fast, scalable, distributed memory, parallel runtime with a Pandas like DataFrame.☆302Updated this week
- Pandas helper functions☆31Feb 19, 2023Updated 3 years ago
- Chinese translation of the polars-book user guide☆30Jan 2, 2024Updated 2 years ago
- Repository for CS282R: Robust Machine Learning at Harvard University.☆74Mar 30, 2018Updated 7 years ago
- 斯坦福大学CS231n课程作业项目:深度学习、卷积神经网络等☆11Mar 2, 2018Updated 8 years ago
- This repo consists of all courses of IBM - Data Science Professional Certificate, providing with techniques covering a wide array of data…☆16Aug 15, 2020Updated 5 years ago
- Dataset corresponding to the paper: "Form2Seq : A Framework for Higher-Order Form Structure Extraction"☆10Feb 17, 2021Updated 5 years ago
- ☆32Dec 22, 2025Updated 2 months ago
- Реализация sklearn-based Transformer-а для Weight of Evidence преобразования☆10May 6, 2020Updated 5 years ago
- ☆32Apr 4, 2022Updated 3 years ago
- Code Snippets & DataSets for Business Analytics & Data Mining/ Machine Learning Algorithms☆15Apr 23, 2018Updated 7 years ago
- Basic framework utilities to quickly start writing production ready Apache Spark applications☆36Dec 15, 2024Updated last year
- Sentiment Analysis of COVID-19 Vaccine-related Twitter Data☆10May 30, 2021Updated 4 years ago
- Repository for the UTN BA Data Science Course 2020☆14Jun 28, 2021Updated 4 years ago
- My scripts from "http://www.codecademy.com/"☆11May 1, 2014Updated 11 years ago
- Fundamental Accounting Concept Relations validation for International Financial Reporting Standards (IFRS).☆14Sep 20, 2018Updated 7 years ago
- Data Catalog for Databases and Data Warehouses☆36Jan 15, 2024Updated 2 years ago
- A Data Mesh demo repository☆13Oct 10, 2024Updated last year