A tool to help you to test and develop pyspark code with sampled and local data
☆15May 3, 2026Updated last month
Alternatives and similar repositories for DDataFlow
Users that are interested in DDataFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Keep your local python scripts installed and in sync with a databricks notebook. Shortens the feedback loop to develop projects using a h…☆16Jun 16, 2025Updated 11 months ago
- ☆12May 8, 2026Updated last month
- Native Polars I/O plugin for Delta Lake, backed by delta-kernel-rs.☆17Jun 2, 2026Updated last week
- Type-annotate your spark dataframes and validate them☆14Feb 5, 2026Updated 4 months ago
- tool to automatically create and update the config file for Dependabot (dependabot.yml)☆16Jun 3, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Streamlit app for "Honey, I broke the PyTorch model" - Talk @ PyCon & PyData 2023☆18Apr 16, 2023Updated 3 years ago
- Ruby library for parsing, serializing, and manipulating GEXF graphs☆21Jan 31, 2014Updated 12 years ago
- The refactoring tutorial I wrote for PyConDE 2022. You can also work through the exercises on your own.☆19Apr 22, 2024Updated 2 years ago
- Tutorial session at PyConDE & Pydata 2024☆12Apr 23, 2024Updated 2 years ago
- R Test Adapter for the VSCode Test Explorer☆14Oct 27, 2025Updated 7 months ago
- An R package implementing an idiosyncratic Stan workflow☆15Aug 18, 2022Updated 3 years ago
- Unofficial MAX PLANCK INSTITUTE RMarkdown templates and ggplot themes☆15Jan 10, 2022Updated 4 years ago
- Repository for the Arrow Columnar Format Tutorial for PyCon DE 2024☆26Apr 24, 2024Updated 2 years ago
- A collection of tools that help me work with Avro☆23Jan 7, 2010Updated 16 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Convert local crontabs to UTC crontabs☆12Jul 9, 2021Updated 4 years ago
- Decorators for logging purposes for all your dataframes☆15Jan 31, 2025Updated last year
- Synthesising graphs and simulating things☆10Oct 25, 2022Updated 3 years ago
- Annotate data using Jupyter notebooks☆12Apr 1, 2022Updated 4 years ago
- Good Enough Practices in Scientific Computing☆24Updated this week
- real-time data + ML pipeline☆53May 11, 2026Updated last month
- Taskwarrior tasks reviewing script☆14May 25, 2015Updated 11 years ago
- A textual TUI for Prodigy☆16Jun 8, 2023Updated 3 years ago
- ✅ CLI to find broken URLs in files (awesome_bot alternative but significantly faster)☆18Aug 10, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Stores email header and body information in JSON format☆12Mar 10, 2016Updated 10 years ago
- ☆11Oct 3, 2023Updated 2 years ago
- Basic Spark utilities☆13Feb 20, 2025Updated last year
- Web-scraping tool to extract and export current portfolio asset information from Scalable Capital and Trade Republic using Selenium libra…☆51Jan 21, 2026Updated 4 months ago
- ☆46Updated this week
- Analyze and model weekly calendar distributions using latent components☆21Updated this week
- ☆44Jun 15, 2023Updated 2 years ago
- Spark data pipeline that processes movie ratings data.☆31May 1, 2026Updated last month
- coco is an opensource conversation collector. or simply a fitness tracker for your conversations. coco is private by default. it runs on …☆31May 19, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Dec 29, 2024Updated last year
- A place to host demos for custom actions.☆14Feb 2, 2022Updated 4 years ago
- ☆16Sep 6, 2022Updated 3 years ago
- Browser extension to open GH repos on different online code editors☆12May 13, 2023Updated 3 years ago
- A collection of network-related python utilities.☆17Sep 8, 2023Updated 2 years ago
- A python script supporting my kanban workflow for taskwarrior☆21Aug 4, 2022Updated 3 years ago
- Apache NetBeans Maven Utils parent pom☆16Updated this week