vaexio / vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second π
β8,257Updated this week
Related projects: β
- Modin: Scale your Pandas workflows by changing a single line of codeβ9,747Updated this week
- Declarative statistical visualization library for Pythonβ9,227Updated this week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.β12,385Updated last week
- Parallel computing with task schedulingβ12,405Updated this week
- VoilΓ turns Jupyter notebooks into standalone web applicationsβ5,394Updated 2 weeks ago
- An open source python library for automated feature engineeringβ7,196Updated this week
- Visual analysis and diagnostic tools to facilitate machine learning model selection.β4,264Updated last year
- Build and manage real-life ML, AI, and data science projects with ease!β8,046Updated this week
- π Parameterize, execute, and analyze notebooksβ5,789Updated 3 weeks ago
- π¦ ML Experiments and Data Management with Gitβ13,608Updated this week
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scriptsβ6,588Updated 2 weeks ago
- A simple and efficient tool to parallelize Pandas operations on all availableΒ CPUsβ3,634Updated 2 months ago
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available mannerβ2,515Updated 5 months ago
- Create delightful software with Jupyter Notebooksβ4,883Updated this week
- An open-source, low-code machine learning library in Pythonβ8,818Updated 2 weeks ago
- Automated Machine Learning with scikit-learnβ7,542Updated last week
- cuDF - GPU DataFrame Libraryβ8,259Updated this week
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.β9,655Updated last month
- HiPlot makes understanding high dimensional data easyβ2,739Updated 8 months ago
- Automatically visualize your pandas dataframe via a single print! π π‘β5,136Updated 5 months ago
- Visualize and compare datasets, target values and associations, with one line of code.β2,909Updated last month
- Hummingbird compiles trained ML models into tensor computation for faster inference.β3,332Updated 3 weeks ago
- Missing data visualization module for Python.β3,896Updated 4 months ago
- Statistical data visualization in Pythonβ12,401Updated last month
- Fit interpretable models. Explain blackbox machine learning.β6,211Updated this week
- Open source platform for the machine learning lifecycleβ18,340Updated this week
- Statsmodels: statistical modeling and econometrics in Pythonβ9,978Updated this week
- Computing with Python functions.β3,815Updated 3 weeks ago
- A library of extension and helper modules for Python's data analysis and machine learning libraries.β4,862Updated 2 months ago
- Interactive Data Visualization in the browser, from Pythonβ19,227Updated this week