vaexio / vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second π
β8,335Updated 4 months ago
Alternatives and similar repositories for vaex:
Users that are interested in vaex are comparing it to the libraries listed below
- Modin: Scale your Pandas workflows by changing a single line of codeβ10,020Updated this week
- Parallel computing with task schedulingβ12,950Updated this week
- VoilΓ turns Jupyter notebooks into standalone web applicationsβ5,581Updated 2 weeks ago
- Create delightful software with Jupyter Notebooksβ5,017Updated 2 weeks ago
- π Parameterize, execute, and analyze notebooksβ6,084Updated last month
- Declarative visualization library for Pythonβ9,588Updated this week
- cuDF - GPU DataFrame Libraryβ8,689Updated this week
- A Python package for manipulating 2-dimensional tabular data structuresβ1,821Updated 3 months ago
- A light-weight, flexible, and expressive statistical data testing libraryβ3,623Updated this week
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available mannerβ2,569Updated 10 months ago
- Visual analysis and diagnostic tools to facilitate machine learning model selection.β4,312Updated 4 months ago
- A simple and efficient tool to parallelize Pandas operations on all availableΒ CPUsβ3,718Updated 7 months ago
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.β12,726Updated this week
- Automatically visualize your pandas dataframe via a single print! π π‘β5,249Updated 10 months ago
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scriptsβ6,742Updated this week
- Open Source AI/ML Platformβ8,535Updated this week
- Visualizer for pandas data structuresβ4,847Updated last month
- Hummingbird compiles trained ML models into tensor computation for faster inference.β3,388Updated 3 weeks ago
- Always know what to expect from your data.β10,201Updated this week
- Computing with Python functions.β3,981Updated 3 weeks ago
- NumPy and Pandas interface to Big Dataβ3,193Updated last year
- the portable Python dataframe libraryβ5,515Updated this week
- Open source platform for the machine learning lifecycleβ19,510Updated this week
- An open source python library for automated feature engineeringβ7,366Updated this week
- Tools for diffing and merging of Jupyter notebooks.β2,700Updated 4 months ago
- HiPlot makes understanding high dimensional data easyβ2,777Updated last year
- Statsmodels: statistical modeling and econometrics in Pythonβ10,434Updated last week
- An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooksβ3,064Updated last year
- STUMPY is a powerful and scalable Python library for modern time series analysisβ3,781Updated 2 weeks ago
- Automatic extraction of relevant features from time series:β8,572Updated this week