☆24Dec 21, 2020Updated 5 years ago
Alternatives and similar repositories for data-analysis-with-python-and-pyspark
Users that are interested in data-analysis-with-python-and-pyspark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C++ 11 version of my Fortran code gmsh-to-vtk-and-tecplot☆12Jan 14, 2022Updated 4 years ago
- ☆18Dec 2, 2025Updated 3 months ago
- Python library & CLI to create, view and edit PFB files☆12Feb 19, 2026Updated last month
- Harry Potter Deep Learning Experiment☆10Mar 25, 2023Updated 2 years ago
- Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects☆49Dec 3, 2024Updated last year
- Data and code for the college major recommender system I built for my Galvanize final project☆10Mar 9, 2017Updated 9 years ago
- Kyoto Encylopedia of Genes and Genomes (KEGG) NetworkX Topological parser automates downloading, parsing, and converting from a KEGG Mark…☆11May 8, 2024Updated last year
- ☆11Jun 15, 2019Updated 6 years ago
- ☆13Jun 20, 2024Updated last year
- Chromax is a breeding simulator based on JAX.☆10Jun 6, 2025Updated 9 months ago
- Run EMR workloads on EKS☆13Sep 6, 2021Updated 4 years ago
- This is a simple demo for integrating Authing in AWS China region to protect API Gateway REST API and other AWS resources such as IoT, Po…☆12Oct 12, 2024Updated last year
- Ascertained Sequentially Markovian Coalescent☆16Oct 22, 2025Updated 5 months ago
- Optimization solvers in pure Python: LP, MILP, SAT, constraint programming, graph and metaheuristics. No dependencies. Solvor all your op…☆26Feb 1, 2026Updated last month
- ☆11Mar 4, 2025Updated last year
- The Genomics Tertiary Analysis and Machine Learning Using Amazon SageMaker solution creates a scalable environment in AWS to develop mach…☆11Jul 7, 2023Updated 2 years ago
- A self-contained, ready to run Airflow and Kafka project. Can be run locally or within codespaces.☆16Jul 15, 2023Updated 2 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 4 months ago
- This is the pipeline of our new article "Enzyme Co-Scientist: Harnessing Large Language Models for Enzyme Kinetic Data Extraction from Li…☆16May 23, 2025Updated 10 months ago
- ☆16Feb 19, 2025Updated last year
- DeepVariant-on-Spark is a germline short variant calling pipeline that runs Google DeepVariant on Apache Spark at scale.☆12May 4, 2022Updated 3 years ago
- An AI-powered literature review assistant for researchers☆25Apr 18, 2025Updated 11 months ago
- Open-source opinionated Galaxy-based framework for microbiota analysis☆14Jan 21, 2021Updated 5 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆13Jun 19, 2024Updated last year
- Code necessary to reproduce experiments in "FloraBERT: cross-species transfer learning with attention-based neural networks for gene expr…☆13Jul 6, 2022Updated 3 years ago
- A tool for phasing and imputing haplotypes in 10k+ low coverage sequencing samples☆10Nov 20, 2020Updated 5 years ago
- Open source alternate for GeoGuessr☆13Nov 4, 2023Updated 2 years ago
- GitHub Action to Test Airflow Dags☆22Feb 5, 2026Updated last month
- ☆13Jun 27, 2024Updated last year
- ☆12Jan 25, 2018Updated 8 years ago
- GEFormer is a genome-wide prediction model for genotype-environment interactions based on a deep learning approach designed to predict ma…☆14Jan 15, 2026Updated 2 months ago
- Notes of Reinforcement Learning MOOC by University of Alberta☆23Jul 7, 2020Updated 5 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- The Gen3 Workflow Execution Service☆10Mar 11, 2022Updated 4 years ago
- Files to Build a Docker Image for Facebook Prophet☆13Feb 7, 2019Updated 7 years ago
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆10Jul 26, 2023Updated 2 years ago
- Python library for Executable Machine Learning Knowledge Graphs☆21Updated this week
- Modeling the genomic regulatory codes of fly, mouse, worm, and fish with deep learning☆17Apr 6, 2021Updated 4 years ago
- ☆15Jan 11, 2024Updated 2 years ago