vaexio / vaex
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second π
β8,299Updated last month
Related projects β
Alternatives and complementary repositories for vaex
- Parallel computing with task schedulingβ12,604Updated this week
- VoilΓ turns Jupyter notebooks into standalone web applicationsβ5,465Updated 2 weeks ago
- Modin: Scale your Pandas workflows by changing a single line of codeβ9,898Updated last month
- Declarative statistical visualization library for Pythonβ9,384Updated this week
- HiPlot makes understanding high dimensional data easyβ2,756Updated 10 months ago
- π Parameterize, execute, and analyze notebooksβ5,977Updated last month
- An open source python library for automated feature engineeringβ7,272Updated this week
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available mannerβ2,541Updated 8 months ago
- Jupyter Notebooks as Markdown Documents, Julia, Python or R scriptsβ6,653Updated 2 months ago
- Create delightful software with Jupyter Notebooksβ4,936Updated this week
- A simple and efficient tool to parallelize Pandas operations on all availableΒ CPUsβ3,686Updated 4 months ago
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.β9,739Updated 3 months ago
- Open Source Platform for developing, scaling and deploying serious ML, AI, and data science systemsβ8,256Updated this week
- Computing with Python functions.β3,880Updated last week
- Open source platform for the machine learning lifecycleβ18,812Updated this week
- An open-source, low-code machine learning library in Pythonβ8,958Updated this week
- π¦ Data Versioning and ML Experimentsβ13,927Updated this week
- Visual analysis and diagnostic tools to facilitate machine learning model selection.β4,293Updated last month
- Low-code framework for building custom LLMs, neural networks, and other AI modelsβ11,192Updated this week
- cuDF - GPU DataFrame Libraryβ8,451Updated this week
- the portable Python dataframe libraryβ5,318Updated this week
- Bayesian Modeling and Probabilistic Programming in Pythonβ8,723Updated this week
- STUMPY is a powerful and scalable Python library for modern time series analysisβ3,666Updated this week
- NumPy and Pandas interface to Big Dataβ3,187Updated last year
- Production infrastructure for machine learning at scaleβ8,021Updated 5 months ago
- Statistical data visualization in Pythonβ12,581Updated 3 months ago
- A light-weight, flexible, and expressive statistical data testing libraryβ3,401Updated this week
- Plotting library for IPython/Jupyter notebooksβ3,628Updated this week
- Panel: The powerful data exploration & web app framework for Pythonβ4,792Updated this week