Population Stratification Analysis on Genomics Data Using Deep Learning
☆25Sep 5, 2016Updated 9 years ago
Alternatives and similar repositories for popstrat
Users that are interested in popstrat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ready-to-go Parquet-formatted public 'omics datasets☆30Nov 2, 2015Updated 10 years ago
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14May 20, 2016Updated 10 years ago
- An example of bioinformatics and bigdata tools can playing nicely together☆14May 17, 2016Updated 10 years ago
- Efficient, distributed downloads of large files from S3 to HDFS using Spark.☆17Apr 26, 2017Updated 9 years ago
- Bioinformatics / Explore oligonucleotide composition similarity between assembly contigs or scaffolds to detect contaminant DNA.☆12Feb 27, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Finding a scalable alternative to the VCF File for genomics analysis☆14Jan 5, 2017Updated 9 years ago
- General utility code used across BDG products. Apache 2 licensed.☆18Mar 17, 2026Updated 2 months ago
- Parallel Genomic Analysis Toolkit☆14Feb 11, 2019Updated 7 years ago
- Pipeline for analyzing rare mutations in metagenome-assembled genomes☆10Apr 4, 2025Updated last year
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆41Feb 13, 2026Updated 4 months ago
- Spark VCF data source implementation for Dataframes☆15Jul 15, 2022Updated 3 years ago
- python stuff I use☆20Feb 16, 2026Updated 3 months ago
- Integrated Variant Caller☆17Mar 15, 2018Updated 8 years ago
- A genomics pipeline build on top of the GATK Queue framework. Main repository: https://github.com/NationalGenomicsInfrastructure/piper (m…☆21Sep 6, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Bioinformatics Ketrew Pipelines☆28Nov 14, 2021Updated 4 years ago
- Automated CWL and Galaxy XML generation for Python tools that use argparse and click☆11Jul 29, 2019Updated 6 years ago
- Companion repo for ExAC paper, 2015☆33Oct 10, 2016Updated 9 years ago
- Toil workflows for common genomic pipelines☆33Oct 3, 2019Updated 6 years ago
- Core consonance utilities for scheduling, reporting on, and provisioning VMs for workflows☆14Jun 27, 2018Updated 7 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Sep 11, 2023Updated 2 years ago
- Data exploration with multiple machine learning algorithms☆14Nov 28, 2021Updated 4 years ago
- A workflow assembler for cancer genome analytics and informatics☆19Nov 16, 2016Updated 9 years ago
- Chromosome Scale Assembler: A high-throughput chromosome scale genome assembly pipeline for vertebrate genomes☆10Oct 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A python package from Pacific Biosciences to analyze centromeric sequences☆21Oct 7, 2015Updated 10 years ago
- Miscellaneous functionality for manipulating Apache Spark RDDs.☆22Dec 29, 2018Updated 7 years ago
- variant integration methods for the 1000 Genomes Project☆21Jan 16, 2018Updated 8 years ago
- Distributed execution of bioinformatics tools on Apache Spark. Apache 2 licensed.☆41Mar 17, 2026Updated 2 months ago
- Sample App. Amazon Product Descriptions Wordcloud. Spark Streaming, Algebird, Storehaus, Redis, Scala Scraper, OpenNLP, Play Framework, D…☆12Nov 9, 2015Updated 10 years ago
- This repository implements converters and tools for working with NGS data in HPC or Hadoop cluster☆17Apr 13, 2018Updated 8 years ago
- Do not use - please refer to our newest code: https://github.com/cgat-developers/cgat-apps☆124Nov 8, 2018Updated 7 years ago
- Location of structural errors in a genome assembly and structural variations between a pair of genomes☆11Sep 27, 2019Updated 6 years ago
- Build an index for your BAM Index (BAI)☆17Apr 14, 2015Updated 11 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- VariantSpark is a framework for applying Spark-based Machine Learning methods to whole-genome variant information☆33Sep 28, 2017Updated 8 years ago
- ☆30Oct 6, 2021Updated 4 years ago
- Genomic signature interpretation tool for DNA double-strand break repair mechanism☆11Oct 8, 2025Updated 8 months ago
- Pynocular is a lightweight ORM that lets you query your database using Pydantic models and asyncio☆11May 24, 2022Updated 4 years ago
- A simple tutorial application for working with Twitter4j using Scala.☆14Feb 26, 2013Updated 13 years ago
- ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 li…☆1,053Mar 17, 2026Updated 2 months ago
- The SnoVault general purpose hybrid object-relational database☆16Mar 7, 2024Updated 2 years ago