airbnb / knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
☆5,482Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for knowledge-repo
- 📚 Parameterize, execute, and analyze notebooks☆5,977Updated last month
- the portable Python dataframe library☆5,318Updated this week
- Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, vis…☆17,883Updated last week
- Plotting library for IPython/Jupyter notebooks☆3,628Updated this week
- Data-Centric Pipelines and Data Versioning☆6,181Updated this week
- Open Source Platform for developing, scaling and deploying serious ML, AI, and data science systems☆8,256Updated this week
- NumPy and Pandas interface to Big Data☆3,187Updated last year
- Visual analysis and diagnostic tools to facilitate machine learning model selection.☆4,293Updated last month
- 📘 The interactive computing suite for you! ✨☆6,212Updated 10 months ago
- A data science IDE for Python☆3,925Updated 6 years ago
- A machine learning package built for humans.☆4,795Updated last month
- Beaker Extensions for Jupyter Notebook☆2,799Updated 11 months ago
- Voilà turns Jupyter notebooks into standalone web applications☆5,465Updated 2 weeks ago
- Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.☆8,753Updated 5 months ago
- Declarative statistical visualization library for Python☆9,384Updated this week
- Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allredu…☆8,488Updated last month
- Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark☆1,481Updated this week
- Tools for diffing and merging of Jupyter notebooks.☆2,677Updated last month
- Parallel computing with task scheduling☆12,604Updated this week
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆9,739Updated 3 months ago
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.☆8,356Updated 3 months ago
- A curated list of data science blogs☆6,318Updated 5 months ago
- Introduction to Statistics and Basics of Mathematics for Data Science - The Hacker's Way☆1,445Updated 6 years ago
- Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow☆2,742Updated 3 years ago
- dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build application…☆9,976Updated this week
- A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin☆6,204Updated last month
- Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per s…☆8,299Updated last month
- H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random F…☆6,928Updated this week
- The Open Source Feature Store for Machine Learning☆5,613Updated this week
- A curated list of data engineering tools for software developers☆6,828Updated 3 weeks ago