Pandas is a powerful, open-source data analysis and manipulation library for Python, designed to make data operations intuitive and fast for application developers. It provides two primary data structures: Series (one-dimensional) and DataFrame (two-dimensional), which allow for flexible data handling similar to spreadsheet or SQL table operations. With Pandas, developers can easily perform tasks like data cleaning, aggregation, transformation, and visualization, thereby simplifying workflows in data-intensive applications. Its seamless integration with other Python libraries, such as NumPy, Matplotlib, and SciPy, enhances its capability to handle a variety of data formats and perform complex scientific computations. Pandas also offers robust I/O functionalities, enabling easy reading and writing of data from formats like CSV, Excel, SQL databases, and JSON, which facilitates rapid development and deployment of analytics-driven apps.
View the most prominent open source pandas projects in the list below. Click on a specific project to view its alternative or complementary packages. Make comparisons and find the best package for your app.
- The 30 Days of Python programming challenge is a step-by-step guide to learn the Python programming language in 30 days. This challenge m…☆58,787Feb 20, 2026Updated last week
- Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects,…☆47,961Updated this week
- Python Data Science Handbook: full text in Jupyter Notebooks☆46,850Jun 26, 2024Updated last year
- 10 Weeks, 20 Lessons, Data Science for All!☆34,014Updated this week
- A Fast, Extensible Progress Bar for Python and CLI☆30,976Feb 14, 2026Updated 2 weeks ago
- Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce,…☆28,896Mar 20, 2024Updated last year
- Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.☆23,218Oct 28, 2025Updated 4 months ago
- Download market data from Yahoo! Finance's API☆21,798Feb 21, 2026Updated last week
- 🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools☆21,228Updated this week
- 阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构☆16,239Jan 24, 2026Updated last month
- PyGWalker: Turn your dataframe into an interactive UI for visual analysis☆15,644Dec 30, 2025Updated 2 months ago
- TuShare is a utility for crawling historical data of China stocks☆14,481Mar 13, 2024Updated last year
- Parallel computing with task scheduling☆13,746Updated this week
- Statistical data visualization in Python☆13,745Jan 22, 2026Updated last month
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,389Feb 2, 2026Updated 3 weeks ago
- 人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning d…☆12,665Jun 2, 2024Updated last year
- Practice your pandas skills!☆12,203Oct 17, 2025Updated 4 months ago
- Open Machine Learning Course☆10,452Jan 24, 2026Updated last month
- Modin: Scale your Pandas workflows by changing a single line of code☆10,362Feb 10, 2026Updated 2 weeks ago
- cuDF - GPU DataFrame Library☆9,498Updated this week
- A terminal spreadsheet multitool for discovering and arranging data☆8,836Feb 21, 2026Updated last week
- 《利用Python进行数据分析·第2版》☆8,794Feb 2, 2026Updated 3 weeks ago
- Repository to store sample python programs for python learning☆7,266Jul 24, 2025Updated 7 months ago
- the portable Python dataframe library☆6,417Updated this week
- Instant Kubernetes-Native Application Observability☆6,360Updated this week
- 🍊 Orange: Interactive data analysis☆5,558Updated this week
- Automatically visualize your pandas dataframe via a single print! 📊 💡☆5,371Mar 20, 2024Updated last year
- pandas中文教程☆5,085Apr 24, 2024Updated last year
- Visualizer for pandas data structures☆5,063Feb 20, 2026Updated last week
- Danfo.js is an open source, JavaScript library providing high performance, intuitive, and easy to use data structures for manipulating an…☆5,042Updated this week
- Python tools for geographic data☆5,052Feb 10, 2026Updated 2 weeks ago
- A comprehensive machine learning repository containing 30+ notebooks on different concepts, algorithms and techniques.☆4,983Sep 22, 2023Updated 2 years ago
- Technical Analysis Library using Pandas and Numpy☆4,896Jul 17, 2024Updated last year
- Mimesis is a fast Python library for generating fake data in multiple languages.☆4,795Jan 19, 2026Updated last month
- Time series forecasting with PyTorch☆4,797Updated this week
- A python wrapper for Alpha Vantage API for financial data.☆4,737Jul 27, 2025Updated 7 months ago
- 利用Python进行数据分析 第二版 (2017) 中文翻译笔记☆4,662May 8, 2018Updated 7 years ago
- Machine Learning Containers for NVIDIA Jetson and JetPack-L4T☆4,394Updated this week
- A light-weight, flexible, and expressive statistical data testing library☆4,212Feb 19, 2026Updated last week
- Missing data visualization module for Python.☆4,194May 14, 2024Updated last year