"Surfing the Data Pipeline with Python" is a textbook that guides people through the steps of getting themselves unstuck, acquiring data, wrangling data, and exploring data.
☆16May 15, 2025Updated last year
Alternatives and similar repositories for surfing-the-data-pipeline
Users that are interested in surfing-the-data-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scan and monitor your network effortlessly! Nmap Prometheus Exporter provides insights into network health and security with Prometheus-c…☆15Oct 2, 2023Updated 2 years ago
- Instance of the Hypermodern Python Cookiecutter☆11Oct 3, 2023Updated 2 years ago
- Common dependencies for data science workflows☆29Updated this week
- NeurIPS 2024 AutoGluon Workshop. See website: https://autogluon.github.io/neurips-autogluon-workshop/☆13Dec 10, 2024Updated last year
- A tool for harvesting media files from Open Access articles for upload into Wikimedia Commons☆25Jul 3, 2016Updated 9 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Sonification middleware☆12Oct 7, 2020Updated 5 years ago
- Create a citation graph from pubmed data using R☆10Nov 4, 2016Updated 9 years ago
- a web based tool to monitor how your website content is used in wikipedia☆37Oct 22, 2020Updated 5 years ago
- Repository of the OpenCitations Index of Crossref open DOI-to-DOI citations (COCI)☆24Aug 25, 2019Updated 6 years ago
- Berkeley's Data8 Infrastructure specific documentation & guides☆10Jun 22, 2020Updated 5 years ago
- Python Hyptertext Preprocessor☆11Jan 26, 2022Updated 4 years ago
- [DEPRECATED] Intelligent media curation tool with filters for managing real-time feeds of information☆23Sep 26, 2011Updated 14 years ago
- ☆17Feb 6, 2018Updated 8 years ago
- submission to https://www.openscienceprize.org/☆11Mar 1, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ShEx schemas for common vocabularies and use cases.☆13Oct 7, 2019Updated 6 years ago
- Parser LexML de documentos normativos☆18May 7, 2026Updated last month
- Materials of FutureTDM project☆11Aug 22, 2017Updated 8 years ago
- A Twitter bot powered by Github Actions tweeting various Wikimedia milestones to some Twitter bots☆11Dec 28, 2023Updated 2 years ago
- Framework for running text mining tools on latest publications. Main page at:☆15Jul 13, 2022Updated 3 years ago
- We created a topic modeling pipeline to evaluate different topic modeling algorithms, including their performance on short and long text,…☆21May 22, 2025Updated last year
- Mastering spaCy, Second Edition published by Packt☆24Feb 4, 2025Updated last year
- stub repo for prelim work on SSRN replacement☆15May 19, 2016Updated 10 years ago
- ☆14Apr 19, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Populate tables with data from the server using JavaScript; includes sorting, and pagination.☆15Jan 8, 2017Updated 9 years ago
- Crowd-source tool to annotate, rank and generally enhance the clarity and accuracy of clinical trial information.☆16Mar 6, 2023Updated 3 years ago
- Tools for laminar analysis of the cortical sheet in Python☆12Jul 23, 2019Updated 6 years ago
- A visualization of the rate of edits to Wikipedia in various languages.☆12Aug 10, 2024Updated last year
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.☆14Dec 15, 2024Updated last year
- ☆15Apr 12, 2022Updated 4 years ago
- Miscellaneous projects, too small to warrant their own github projects, in their own subdirectories here.☆14Aug 15, 2020Updated 5 years ago
- This is the sequential Encoder-Decoder implementation of Neural Machine Translation using Keras☆17Aug 5, 2018Updated 7 years ago
- The home repository for the (Re)usable Data Project.☆14Mar 29, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 📕Ansible playbooks for Raspberry Pi, Linux and Mac☆14Dec 22, 2024Updated last year
- 🏘️ Hubness reduced nearest neighbor search for entity alignment with knowledge graph embeddings☆29Apr 23, 2024Updated 2 years ago
- A repo for collecting content for the Jupyter Newsletter☆13Sep 28, 2016Updated 9 years ago
- This repository is the source code of the Python Tutorial Based on the Official Documentation: https://youtu.be/ne4Xsoe5Br8☆25Jun 17, 2021Updated 4 years ago
- What can social media tell us about an article's impact?☆23Jun 7, 2012Updated 14 years ago
- ☆14Feb 17, 2025Updated last year
- Totally awesome Textmate bundle for Turtle – the terse RDF Triple Language.☆28Feb 12, 2018Updated 8 years ago