☆24Dec 21, 2020Updated 5 years ago
Alternatives and similar repositories for data-analysis-with-python-and-pyspark
Users that are interested in data-analysis-with-python-and-pyspark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repository for the "PySpark in Action" book☆217Jun 11, 2025Updated 10 months ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects☆50Dec 3, 2024Updated last year
- Kyoto Encylopedia of Genes and Genomes (KEGG) NetworkX Topological parser automates downloading, parsing, and converting from a KEGG Mark…☆12May 8, 2024Updated last year
- ☆11Jun 15, 2019Updated 6 years ago
- ☆13Jun 20, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository implements a variety of sequence model architectures from scratch in PyTorch. Effort has been put to make the code well s…☆12Jun 25, 2021Updated 4 years ago
- An open and introductory book for the Python API of Apache Spark (pyspark) 📚📖☆12Sep 19, 2025Updated 7 months ago
- A simple python SDK around PubMed API.☆22Jan 1, 2025Updated last year
- This is a simple demo for integrating Authing in AWS China region to protect API Gateway REST API and other AWS resources such as IoT, Po…☆12Oct 12, 2024Updated last year
- ☆10Feb 22, 2023Updated 3 years ago
- Ascertained Sequentially Markovian Coalescent☆18Oct 22, 2025Updated 6 months ago
- Transform natural language into beautiful, interactive data visualizations using the Model Context Protocol (MCP) with Claude Desktop int…☆19Jun 27, 2025Updated 10 months ago
- Automated Machine Learning for Environmental Data-Driven Genome Prediction☆14Sep 12, 2025Updated 7 months ago
- Optimization solvers in pure Python: LP, MILP, SAT, constraint programming, graph and metaheuristics. No dependencies. Solvor all your op…☆28Apr 7, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Mar 4, 2025Updated last year
- KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion☆12Oct 21, 2021Updated 4 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 5 months ago
- This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…☆17May 23, 2025Updated 11 months ago
- ☆15Feb 19, 2025Updated last year
- DeepVariant-on-Spark is a germline short variant calling pipeline that runs Google DeepVariant on Apache Spark at scale.☆12May 4, 2022Updated 4 years ago
- ☆13Mar 1, 2018Updated 8 years ago
- Complete SQL + Databases Bootcamp: Zero to Mastery [2020]☆31Sep 29, 2020Updated 5 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆13Jun 19, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Open-source opinionated Galaxy-based framework for microbiota analysis☆13Jan 21, 2021Updated 5 years ago
- Integrative protein sequence design with evolutionary multiobjective optimization.☆12Jul 16, 2024Updated last year
- Code necessary to reproduce experiments in "FloraBERT: cross-species transfer learning with attention-based neural networks for gene expr…☆13Jul 6, 2022Updated 3 years ago
- Knowledge Graph Embeddings (KGE) to implement Explainable Artificial Intelligence. As AI develops users must know how algorithms make the…☆11Apr 17, 2026Updated 2 weeks ago
- A tool for phasing and imputing haplotypes in 10k+ low coverage sequencing samples☆10Nov 20, 2020Updated 5 years ago
- ☆12Jan 25, 2018Updated 8 years ago
- ☆13Jun 27, 2024Updated last year
- simulate sequence data and complicated pedigree structures☆16Apr 5, 2023Updated 3 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An implementation of pneumonia medical X-ray image classification problem using Federated Learning in PySyft.☆13Jun 16, 2019Updated 6 years ago
- Files to Build a Docker Image for Facebook Prophet☆13Feb 7, 2019Updated 7 years ago
- Python library for creating, validating, and executing machine learning pipelines represented as Executable Knowledge Graphs.☆21Mar 19, 2026Updated last month
- Time series analysis on AWS, published by Packt☆16Mar 2, 2026Updated 2 months ago
- ☆15Sep 26, 2019Updated 6 years ago
- Accessibility-ready business WordPress theme.☆15Sep 3, 2025Updated 8 months ago
- This is the PyTorch Implementation for our model VRKG4Rec (WSDM'23)☆20Mar 30, 2023Updated 3 years ago