Pandas is a powerful, open-source data analysis and manipulation library for Python, designed to make data operations intuitive and fast for application developers. It provides two primary data structures: Series (one-dimensional) and DataFrame (two-dimensional), which allow for flexible data handling similar to spreadsheet or SQL table operations. With Pandas, developers can easily perform tasks like data cleaning, aggregation, transformation, and visualization, thereby simplifying workflows in data-intensive applications. Its seamless integration with other Python libraries, such as NumPy, Matplotlib, and SciPy, enhances its capability to handle a variety of data formats and perform complex scientific computations. Pandas also offers robust I/O functionalities, enabling easy reading and writing of data from formats like CSV, Excel, SQL databases, and JSON, which facilitates rapid development and deployment of analytics-driven apps.
View the most prominent open source pandas projects in the list below. Click on a specific project to view its alternative or complementary packages. Make comparisons and find the best package for your app.
- Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects,…☆44,532Updated this week
- 30 days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge may t…☆44,431Updated this week
- Python Data Science Handbook: full text in Jupyter Notebooks☆43,836Updated 7 months ago
- A Fast, Extensible Progress Bar for Python and CLI☆29,241Updated last week
- 10 Weeks, 20 Lessons, Data Science for All!☆28,792Updated 3 months ago
- Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce,…☆27,850Updated 10 months ago
- 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools☆19,605Updated this week
- Download market data from Yahoo! Finance's API☆15,681Updated this week
- PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis☆14,044Updated last month
- TuShare is a utility for crawling historical data of China stocks☆13,069Updated 11 months ago
- Parallel computing with task scheduling☆12,923Updated this week
- Statistical data visualization in Python☆12,851Updated 2 weeks ago
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆12,703Updated last week
- 阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构☆12,752Updated 2 months ago
- Practice your pandas skills!☆11,058Updated 5 months ago
- 人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning d…☆10,402Updated 8 months ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,018Updated this week
- Open Machine Learning Course☆9,912Updated last month
- cuDF - GPU DataFrame Library☆8,669Updated this week
- 《利用Python进行数据分析·第2版》☆8,104Updated 5 months ago
- A terminal spreadsheet multitool for discovering and arranging data☆8,041Updated this week
- Repository to store sample python programs for python learning☆6,943Updated 6 months ago
- Instant Kubernetes-Native Application Observability☆5,795Updated this week
- Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators☆5,763Updated 6 months ago
- the portable Python dataframe library☆5,511Updated this week
- Automatically visualize your pandas dataframe via a single print! 📊 💡☆5,244Updated 10 months ago
- 🍊 Orange: Interactive data analysis☆4,984Updated this week
- Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating an…☆4,841Updated 7 months ago
- Visualizer for pandas data structures☆4,843Updated last month
- pandas中文教程☆4,743Updated 9 months ago
- A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.☆4,742Updated last year
- 利用Python进行数据分析 第二版 (2017) 中文翻译笔记☆4,612Updated 6 years ago
- Python tools for geographic data☆4,628Updated 3 weeks ago
- Mimesis is a robust data generator for Python that can produce a wide range of fake data in multiple languages.☆4,488Updated this week
- Technical Analysis Library using Pandas and Numpy☆4,512Updated 6 months ago
- A python wrapper for Alpha Vantage API for financial data.☆4,385Updated 6 months ago
- Time series forecasting with PyTorch☆4,129Updated last week
- Missing data visualization module for Python.☆4,027Updated 8 months ago
- pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoD…☆3,977Updated this week