"Surfing the Data Pipeline with Python" is a textbook that guides people through the steps of getting themselves unstuck, acquiring data, wrangling data, and exploring data.
☆16May 15, 2025Updated 11 months ago
Alternatives and similar repositories for surfing-the-data-pipeline
Users that are interested in surfing-the-data-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep learning for interpreting chest x-rays☆13Feb 12, 2025Updated last year
- Introduction materials for Reproducible Research Curriculum with Jupyter notebook☆11Jan 24, 2018Updated 8 years ago
- Sonification middleware☆12Oct 7, 2020Updated 5 years ago
- A Wikidata puzzle game☆19Nov 5, 2016Updated 9 years ago
- generate code_swarm data from Wikipedia page histories & user contributions☆25Nov 19, 2008Updated 17 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Create a citation graph from pubmed data using R☆10Nov 4, 2016Updated 9 years ago
- Publication and sharing with Jupyter notebooks for reproducible research☆10May 10, 2019Updated 6 years ago
- Tutorial and hands-on notebook on using the Knowledge Graph Toolkit (KGTK)☆82Jul 7, 2022Updated 3 years ago
- a web based tool to monitor how your website content is used in wikipedia☆37Oct 22, 2020Updated 5 years ago
- Repository of the OpenCitations Index of Crossref open DOI-to-DOI citations (COCI)☆23Aug 25, 2019Updated 6 years ago
- Berkeley's Data8 Infrastructure specific documentation & guides☆10Jun 22, 2020Updated 5 years ago
- Python Hyptertext Preprocessor☆11Jan 26, 2022Updated 4 years ago
- ☆12Apr 7, 2026Updated last week
- ShEx schemas for common vocabularies and use cases.☆13Oct 7, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Materials of FutureTDM project☆11Aug 22, 2017Updated 8 years ago
- A Twitter bot powered by Github Actions tweeting various Wikimedia milestones to some Twitter bots☆11Dec 28, 2023Updated 2 years ago
- Framework for running text mining tools on latest publications. Main page at:☆15Jul 13, 2022Updated 3 years ago
- Mastering spaCy, Second Edition published by Packt☆24Feb 4, 2025Updated last year
- Resources for Teaching Statistics Using Baseball, 2nd Edition by Jim Albert☆16May 4, 2017Updated 8 years ago
- ☆14Apr 19, 2022Updated 4 years ago
- Code to compute topic coherence for several topic cardinalities and aggregate scores across them☆21Sep 10, 2025Updated 7 months ago
- Diversity, Equity & Inclusion at OpenCon: A report to keep OpenCon transparent and accountable to our commitments to equity, diversity, a…☆11Jan 16, 2018Updated 8 years ago
- A simple IPython/Jupyter cell magic command to display name and version of imported modules.☆16Aug 27, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This is all my random garbage.☆26Jul 6, 2023Updated 2 years ago
- Code for analysing Wikidata SPARQL query logs☆12Dec 8, 2022Updated 3 years ago
- https://1000-plus.github.io/☆24Apr 7, 2026Updated last week
- Tools for laminar analysis of the cortical sheet in Python☆12Jul 23, 2019Updated 6 years ago
- Core do Portal Modelo com o buildout de desenvolvimento e produção☆16Nov 17, 2017Updated 8 years ago
- My supporting code for Google Cloud & NCAA® ML Competition 2019-Men's (4th place finish)☆19Apr 17, 2019Updated 7 years ago
- Materials for SDM 2023 tutorial: Augmentation Methods for Graph Learning☆21Apr 28, 2023Updated 2 years ago
- front-end web app for binder deployments☆16Aug 22, 2017Updated 8 years ago
- A BitTorrent tracker for legal open sharing of scientific data☆15Jul 20, 2011Updated 14 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ChatOps for Tinkerbell☆15Jun 11, 2020Updated 5 years ago
- ☆15Apr 12, 2022Updated 4 years ago
- This is the sequential Encoder-Decoder implementation of Neural Machine Translation using Keras☆17Aug 5, 2018Updated 7 years ago
- The home repository for the (Re)usable Data Project.☆14Mar 29, 2026Updated 3 weeks ago
- 📕Ansible playbooks for Raspberry Pi, Linux and Mac☆14Dec 22, 2024Updated last year
- 🏘️ Hubness reduced nearest neighbor search for entity alignment with knowledge graph embeddings☆29Apr 23, 2024Updated last year
- A repo for collecting content for the Jupyter Newsletter☆13Sep 28, 2016Updated 9 years ago