☆24Dec 21, 2020Updated 5 years ago
Alternatives and similar repositories for data-analysis-with-python-and-pyspark
Users that are interested in data-analysis-with-python-and-pyspark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repository for the "PySpark in Action" book☆218Jun 11, 2025Updated last year
- C++ 11 version of my Fortran code gmsh-to-vtk-and-tecplot☆12Jan 14, 2022Updated 4 years ago
- ☆19Dec 2, 2025Updated 6 months ago
- Kyoto Encylopedia of Genes and Genomes (KEGG) NetworkX Topological parser automates downloading, parsing, and converting from a KEGG Mark…☆12May 8, 2024Updated 2 years ago
- Utilities for simple needs☆27Sep 7, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jun 15, 2019Updated 6 years ago
- ☆12Jun 20, 2024Updated last year
- Python library & CLI to create, view and edit PFB files☆14Apr 24, 2026Updated last month
- ☆16Nov 17, 2023Updated 2 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- This is a simple demo for integrating Authing in AWS China region to protect API Gateway REST API and other AWS resources such as IoT, Po…☆12Oct 12, 2024Updated last year
- Open episode of the data engineering practice course☆32Jul 2, 2024Updated last year
- Ascertained Sequentially Markovian Coalescent☆18Oct 22, 2025Updated 7 months ago
- Transform natural language into beautiful, interactive data visualizations using the Model Context Protocol (MCP) with Claude Desktop int…☆19Jun 27, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Automated Machine Learning for Environmental Data-Driven Genome Prediction☆14Sep 12, 2025Updated 9 months ago
- Robot Framework keyword library for CSV files☆19Sep 3, 2024Updated last year
- A self-contained, ready to run Airflow and Kafka project. Can be run locally or within codespaces.☆16Jul 15, 2023Updated 2 years ago
- ☆14Oct 29, 2024Updated last year
- This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…☆17May 23, 2025Updated last year
- ☆16Feb 19, 2025Updated last year
- DeepVariant-on-Spark is a germline short variant calling pipeline that runs Google DeepVariant on Apache Spark at scale.☆12May 4, 2022Updated 4 years ago
- Complete SQL + Databases Bootcamp: Zero to Mastery [2020]☆32Sep 29, 2020Updated 5 years ago
- Open-source opinionated Galaxy-based framework for microbiota analysis☆13Jan 21, 2021Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Integrative protein sequence design with evolutionary multiobjective optimization.☆13Jul 16, 2024Updated last year
- Code necessary to reproduce experiments in "FloraBERT: cross-species transfer learning with attention-based neural networks for gene expr…☆13Jul 6, 2022Updated 3 years ago
- Knowledge Graph Embeddings (KGE) to implement Explainable Artificial Intelligence. As AI develops users must know how algorithms make the…☆11Apr 17, 2026Updated last month
- A tool for phasing and imputing haplotypes in 10k+ low coverage sequencing samples☆10Nov 20, 2020Updated 5 years ago
- ☆11Updated this week
- ☆12Jan 25, 2018Updated 8 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- Files to Build a Docker Image for Facebook Prophet☆13Feb 7, 2019Updated 7 years ago
- The Gen3 Workflow Execution Service☆10Mar 11, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python library for creating, validating, and executing machine learning pipelines represented as Executable Knowledge Graphs.☆22Mar 19, 2026Updated 2 months ago
- ☆16Mar 24, 2025Updated last year
- correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations☆14Dec 17, 2024Updated last year
- Modeling the genomic regulatory codes of fly, mouse, worm, and fish with deep learning☆17Apr 6, 2021Updated 5 years ago
- ☆34Jan 27, 2024Updated 2 years ago
- ☆15Sep 26, 2019Updated 6 years ago
- This is the PyTorch Implementation for our model VRKG4Rec (WSDM'23)☆20Mar 30, 2023Updated 3 years ago