☆24Dec 21, 2020Updated 5 years ago
Alternatives and similar repositories for data-analysis-with-python-and-pyspark
Users that are interested in data-analysis-with-python-and-pyspark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Dec 2, 2025Updated 5 months ago
- Harry Potter Deep Learning Experiment☆10Mar 25, 2023Updated 3 years ago
- Kyoto Encylopedia of Genes and Genomes (KEGG) NetworkX Topological parser automates downloading, parsing, and converting from a KEGG Mark…☆12May 8, 2024Updated 2 years ago
- ☆11Jun 15, 2019Updated 6 years ago
- Chromax is a breeding simulator based on JAX.☆10Jun 6, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository implements a variety of sequence model architectures from scratch in PyTorch. Effort has been put to make the code well s…☆12Jun 25, 2021Updated 4 years ago
- Run EMR workloads on EKS☆13Sep 6, 2021Updated 4 years ago
- Python library & CLI to create, view and edit PFB files☆14Apr 24, 2026Updated last month
- An open and introductory book for the Python API of Apache Spark (pyspark) 📚📖☆12Sep 19, 2025Updated 8 months ago
- ☆16Nov 17, 2023Updated 2 years ago
- A simple python SDK around PubMed API.☆22Jan 1, 2025Updated last year
- ☆10Feb 22, 2023Updated 3 years ago
- Transform natural language into beautiful, interactive data visualizations using the Model Context Protocol (MCP) with Claude Desktop int…☆19Jun 27, 2025Updated 10 months ago
- Automated Machine Learning for Environmental Data-Driven Genome Prediction☆14Sep 12, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Optimization solvers in pure Python: LP, MILP, SAT, constraint programming, graph and metaheuristics. No dependencies. Solvor all your op…☆28Apr 7, 2026Updated last month
- The Genomics Tertiary Analysis and Machine Learning Using Amazon SageMaker solution creates a scalable environment in AWS to develop mach…☆11Jul 7, 2023Updated 2 years ago
- KACC: A Multi-task Benchmark for Knowledge Abstraction, Concretization and Completion☆12Oct 21, 2021Updated 4 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 6 months ago
- This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…☆17May 23, 2025Updated last year
- ☆16Feb 19, 2025Updated last year
- DeepVariant-on-Spark is a germline short variant calling pipeline that runs Google DeepVariant on Apache Spark at scale.☆12May 4, 2022Updated 4 years ago
- ☆13Mar 1, 2018Updated 8 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆13Jun 19, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Complete SQL + Databases Bootcamp: Zero to Mastery [2020]☆32Sep 29, 2020Updated 5 years ago
- Integrative protein sequence design with evolutionary multiobjective optimization.☆13Jul 16, 2024Updated last year
- Code necessary to reproduce experiments in "FloraBERT: cross-species transfer learning with attention-based neural networks for gene expr…☆13Jul 6, 2022Updated 3 years ago
- Open source alternate for GeoGuessr☆13Nov 4, 2023Updated 2 years ago
- ☆11May 14, 2026Updated last week
- ☆12Jan 25, 2018Updated 8 years ago
- ☆13Jun 27, 2024Updated last year
- Notes of Reinforcement Learning MOOC by University of Alberta☆23Jul 7, 2020Updated 5 years ago
- simulate sequence data and complicated pedigree structures☆16Apr 5, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- Python library for creating, validating, and executing machine learning pipelines represented as Executable Knowledge Graphs.☆21Mar 19, 2026Updated 2 months ago
- correlationMatrix is a Python powered library for the statistical analysis and visualization of correlations☆14Dec 17, 2024Updated last year
- ☆15Jan 11, 2024Updated 2 years ago
- Modeling the genomic regulatory codes of fly, mouse, worm, and fish with deep learning☆17Apr 6, 2021Updated 5 years ago
- ☆15Sep 26, 2019Updated 6 years ago
- Accessibility-ready business WordPress theme.☆14Sep 3, 2025Updated 8 months ago