☆26Dec 21, 2020Updated 5 years ago
Alternatives and similar repositories for data-analysis-with-python-and-pyspark
Users that are interested in data-analysis-with-python-and-pyspark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Dec 2, 2025Updated 7 months ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects☆50Dec 3, 2024Updated last year
- Kyoto Encylopedia of Genes and Genomes (KEGG) NetworkX Topological parser automates downloading, parsing, and converting from a KEGG Mark…☆12May 8, 2024Updated 2 years ago
- ☆12Jun 20, 2024Updated 2 years ago
- Chromax is a breeding simulator based on JAX.☆10Jun 6, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Python library & CLI to create, view and edit PFB files☆14Apr 24, 2026Updated 2 months ago
- An open and introductory book for the Python API of Apache Spark (pyspark) 📚📖☆12Sep 19, 2025Updated 9 months ago
- A simple python SDK around PubMed API.☆23Jan 1, 2025Updated last year
- ☆10Feb 22, 2023Updated 3 years ago
- This is a simple demo for integrating Authing in AWS China region to protect API Gateway REST API and other AWS resources such as IoT, Po…☆12Oct 12, 2024Updated last year
- Ascertained Sequentially Markovian Coalescent☆18Oct 22, 2025Updated 8 months ago
- Transform natural language into beautiful, interactive data visualizations using the Model Context Protocol (MCP) with Claude Desktop int…☆19Jun 27, 2025Updated last year
- Automated Machine Learning for Environmental Data-Driven Genome Prediction☆14Sep 12, 2025Updated 9 months ago
- Optimization solvers in pure Python: LP, MILP, SAT, constraint programming, graph and metaheuristics. No dependencies. Solvor all your op…☆29Apr 7, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion☆12Oct 21, 2021Updated 4 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 7 months ago
- This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…☆17May 23, 2025Updated last year
- ☆16Feb 19, 2025Updated last year
- DeepVariant-on-Spark is a germline short variant calling pipeline that runs Google DeepVariant on Apache Spark at scale.☆12May 4, 2022Updated 4 years ago
- ☆12Mar 1, 2018Updated 8 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆13Jun 19, 2024Updated 2 years ago
- Open-source opinionated Galaxy-based framework for microbiota analysis☆13Jan 21, 2021Updated 5 years ago
- Integrative protein sequence design with evolutionary multiobjective optimization.☆13Jul 16, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code necessary to reproduce experiments in "FloraBERT: cross-species transfer learning with attention-based neural networks for gene expr…☆13Jul 6, 2022Updated 3 years ago
- Knowledge Graph Embeddings (KGE) to implement Explainable Artificial Intelligence. As AI develops users must know how algorithms make the…☆11Apr 17, 2026Updated 2 months ago
- Open source alternate for GeoGuessr☆13Nov 4, 2023Updated 2 years ago
- A tool for phasing and imputing haplotypes in 10k+ low coverage sequencing samples☆10Nov 20, 2020Updated 5 years ago
- ☆11Jun 26, 2026Updated last week
- ☆12Jan 25, 2018Updated 8 years ago
- ☆13Jun 27, 2024Updated 2 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- An implementation of pneumonia medical X-ray image classification problem using Federated Learning in PySyft.☆13Jun 16, 2019Updated 7 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Files to Build a Docker Image for Facebook Prophet☆13Feb 7, 2019Updated 7 years ago
- The Gen3 Workflow Execution Service☆10Mar 11, 2022Updated 4 years ago
- Python library for creating, validating, and executing machine learning pipelines represented as Executable Knowledge Graphs.☆22Mar 19, 2026Updated 3 months ago
- ☆16Mar 24, 2025Updated last year
- correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations☆14Dec 17, 2024Updated last year
- Time series analysis on AWS, published by Packt☆17Mar 2, 2026Updated 4 months ago
- ☆15Jan 11, 2024Updated 2 years ago