Python and Pandas are known to have issues around scalability and efficiency. You will learn how to use libraries such as Modin, Dask, Ray, Vaex etc to overcome the problems faced by Pandas.
☆129Feb 20, 2024Updated 2 years ago
Alternatives and similar repositories for Python-big-data
Users that are interested in Python-big-data are comparing it to the libraries listed below
Sorting:
- ☆84Jul 20, 2024Updated last year
- This topic explains about the implementation of exploratory data analysis (EDA). A total of 21 EDA case studies have been implemented usi…☆196Jan 12, 2025Updated last year
- ☆102Dec 2, 2023Updated 2 years ago
- ☆62Oct 10, 2023Updated 2 years ago
- ☆35May 20, 2024Updated last year
- This course presents to the students recent research and industrial issues pertaining to data engineering, database systems and technolog…☆126Jan 2, 2025Updated last year
- High performance data processing employs high performance computing (HPC) to process data, which is then translated into information and …☆123Jan 17, 2026Updated last month
- ☆22May 3, 2024Updated last year
- Course covers big data fundamentals, processes, technologies, platform ecosystem, and management for practical application development.☆58Oct 8, 2025Updated 4 months ago
- ☆28Sep 12, 2025Updated 5 months ago
- Course material-related information.☆23Jan 8, 2025Updated last year
- The daily life of a PhD student may differ significantly from that of an undergraduate or Masters student. There will be much more indepe…☆37Updated this week
- Final Year Project or commonly known as a Projek Sarjana Muda (PSM) is a course whereby each undergraduate student must undertake and pas…☆66Feb 8, 2025Updated last year
- ☆18Mar 11, 2024Updated last year
- This course is designed to provide students with in depth knowledge on software project planning, cost estimation and scheduling, projec…☆23Jan 4, 2025Updated last year
- This repository is used to store the necessary materials for writing an original book. It serves as a centralized location for authors to…☆10Mar 18, 2024Updated last year
- This repository hosts an Obsidian vault tailored for conducting a Systematic Literature Review (SLR). This repository is part of the acti…☆86Mar 11, 2024Updated last year
- ☆10Dec 29, 2024Updated last year
- This course will cover the fundamental steps and implementation on developing the initial ideas to formal academic writing accordingly. S…☆95Jul 26, 2025Updated 7 months ago
- This repository offers educational resources, code examples, and implementations relating to data structures and algorithms using C++ pro…☆17Jun 15, 2024Updated last year
- Hands-on Exploratory Data Analysis with Python, published by Packt☆868Feb 5, 2026Updated 3 weeks ago
- AI-powered literature review tools leverage machine learning to expedite and enhance the scholarly process of identifying, analyzing, an…☆79Oct 13, 2025Updated 4 months ago
- ☆41Oct 13, 2024Updated last year
- snowball option pricing, Monte Carlo, PDE, Greeks☆10Apr 28, 2023Updated 2 years ago
- Utilizing a combination of Excel, SQL, and Power BI, I delved into an extensive dataset comprising over 50,000 entries of pizza sales dat…☆14Mar 12, 2024Updated last year
- This repository offers educational resources, code examples, and implementations relating to object oriented programming using C++ progra…☆19Jul 10, 2024Updated last year
- This UE4 project contains the Telekinesis Mechanic for Control☆11Jul 26, 2020Updated 5 years ago
- A high-performance Python SDK for the ProjectX Trading Platform Gateway API. This library enables developers to build sophisticated tradi…☆23Sep 23, 2025Updated 5 months ago
- An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.☆15May 23, 2024Updated last year
- My implemention of Hidden Markov Model(HMM) and Conditional Random Field(CRF) for Part of Speech tagging in python 3.6☆11Nov 7, 2018Updated 7 years ago
- ☆12Mar 26, 2020Updated 5 years ago
- TradeGPT is a full-stack cryptocurrency trading application that combines a modern Fresh (Deno) frontend with a Python (FastAPI) backend …☆15Mar 1, 2025Updated last year
- ChatTube: A Retrieval QA System to Youtube Videos☆10Jun 6, 2023Updated 2 years ago
- Pair Trading Analysis & Exercises Toolkit [Jupyter Notebook]☆12Nov 3, 2023Updated 2 years ago
- Combine Spark and Python to process large datasets and unlock the power of parallel computing and machine learning☆40May 10, 2019Updated 6 years ago
- This project uses PySpark and Python to analyze a Google Play Store dataset. It covers data cleaning, duplicate removal, and visual analy…☆12Apr 6, 2022Updated 3 years ago
- FastAPI wrapper for LLM, a fork of (oobabooga / text-generation-webui)☆10Jun 1, 2023Updated 2 years ago
- Notebooks exploring the Canadian Institute of Cybersecurity's IoT dataset.☆11Mar 6, 2024Updated last year
- strategy backtesting framework☆12Oct 22, 2024Updated last year