criteo/cluster-pack

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/criteo/cluster-pack)

criteo / cluster-pack

A library on top of either pex or conda-pack to make your Python code easily available on a cluster

☆47

Alternatives and similar repositories for cluster-pack

Users that are interested in cluster-pack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

criteo / mlflow-yarn
View on GitHub
Backend implementation for running MLFlow projects on Hadoop/YARN.
☆11Dec 27, 2022Updated 3 years ago
criteo / tf-yarn
View on GitHub
Train TensorFlow models on YARN in just a few lines of code!
☆93Nov 3, 2023Updated 2 years ago
jcrist / skein
View on GitHub
A tool and library for easily deploying applications on Apache YARN
☆145Mar 12, 2024Updated 2 years ago
criteo / deepr
View on GitHub
The deepr module provide abstractions (layers, readers, prepro, metrics, config) to help build tensorflow models on top of tf estimators
☆53Nov 10, 2023Updated 2 years ago
jcrist / hadoop-test-cluster
View on GitHub
Dockerized setup for testing code on realistic hadoop clusters
☆26Jul 20, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
joerg-schneider / airtunnel
View on GitHub
The sane way of building a data layer in Airflow
☆24Dec 5, 2019Updated 6 years ago
psygrammer / ParkS
View on GitHub
☆16Dec 4, 2017Updated 8 years ago
ROM-mm / spark-setup
View on GitHub
Instalador autonomo do Apache Spark para Sistemas linux: based(Debian,RHEL)
☆13Dec 10, 2024Updated last year
Gabrielcarvfer / Estrutura-de-dados-UnB
View on GitHub
Material de apoio para a disciplina de estruturas de dados
☆13Jul 16, 2023Updated 3 years ago
broxtronix / spark-gce
View on GitHub
A tool for running Spark on Google Compute Engine
☆16Jan 20, 2017Updated 9 years ago
alarcon7a / redshift_course
View on GitHub
Data repository of redshift course
☆15Sep 4, 2020Updated 5 years ago
japila-books / pyspark-internals
View on GitHub
The Internals of PySpark
☆28Dec 29, 2024Updated last year
sergey-serebryakov / tensorflow-internals
View on GitHub
It is open source ebook about TensorFlow kernel and implementation mechanism.
☆17Nov 24, 2018Updated 7 years ago
yaooqinn / itachi
View on GitHub
A library that brings useful functions from various modern database management systems to Apache Spark
☆63Sep 4, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bdelbosc / jmxstat
View on GitHub
Poll JMX attributes from the command line
☆17Sep 4, 2012Updated 13 years ago
desertaxle / airbyte-prefect-recipe
View on GitHub
Demo of orchestrating Airbyte connections with Prefect
☆11Mar 3, 2022Updated 4 years ago
lichenran1234 / load-test
View on GitHub
☆13Apr 24, 2023Updated 3 years ago
scalingpythonml / scaling-python-with-dask
View on GitHub
A work-in-progress book on Dask
☆12Jul 15, 2023Updated 3 years ago
layer6ai-labs / TAFA
View on GitHub
Code for the RecSys'20 paper "TAFA: Two-headed Attention Fused Autoencoder for Context-Aware Recommendations"
☆19Aug 15, 2020Updated 5 years ago
ykursadkaya / pyspark-Docker
View on GitHub
PySpark in Docker Containers
☆29Jun 22, 2022Updated 4 years ago
yennanliu / utility_shell
View on GitHub
Collection of shell/Bash scripts for various using cases | #SE
☆11Jul 10, 2026Updated 2 weeks ago
gavin-s-smith / mcrforest
View on GitHub
☆11Aug 22, 2025Updated 11 months ago
kurron / docker-snowsql
View on GitHub
Docker image that contains the Snowflake CLI
☆15Jan 30, 2021Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
corriebar / statrethinking_reading_group
View on GitHub
Material for the Berlin Bayesian reading group covering Statistical Rethinking by Richard McElreath
☆10May 7, 2020Updated 6 years ago
nguqtruong / tiki-price-watch
View on GitHub
Theo dõi biến động giá sản phẩm TIKI với Github Actions
☆14Jan 16, 2022Updated 4 years ago
ulfaslak / what-the-chat
View on GitHub
A small application that summarizes conversation in a Discord channel
☆20Oct 27, 2025Updated 8 months ago
emsixteeen / IterativeReduce
View on GitHub
Iterative Reduce
☆22Jun 3, 2014Updated 12 years ago
nishansubedi / fastText
View on GitHub
Library for fast text representation and classification.
☆10Apr 17, 2022Updated 4 years ago
hbutani / icebergSQL
View on GitHub
Integration of Iceberg table management into Spark SQL
☆11Jan 21, 2020Updated 6 years ago
alexeygrigorev / cikm-cup-2016-cross-device
View on GitHub
Solution for the Cross-Device linking challenge from CIKM CUP 2016
☆24Dec 6, 2016Updated 9 years ago
scala-infer / scala-infer
View on GitHub
Scala embedded universal probabilistic programming language
☆11Apr 15, 2021Updated 5 years ago
JervyShi / reactor-guide-zh
View on GitHub
Reactor Guide 中文翻译
☆11Nov 9, 2015Updated 10 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yugabyte / terraform-aws-yugabyte
View on GitHub
A Terraform module to deploy and run YugabyteDB on AWS.
☆21Updated this week
allwefantasy / sql-code-intelligence
View on GitHub
sql code autocomplete
☆45Sep 2, 2020Updated 5 years ago
AdiVarma27 / pyAB
View on GitHub
Python package for Bayesian & Frequentist A/B Testing
☆12Jul 6, 2023Updated 3 years ago
criteo / babar
View on GitHub
Profiler for large-scale distributed java applications (Spark, Scalding, MapReduce, Hive,...) on YARN.
☆129Sep 7, 2018Updated 7 years ago
rdblue / s3committer
View on GitHub
Hadoop output committers for S3
☆114Jul 9, 2020Updated 6 years ago
afiaka87 / dalle-pytorch-datasets
View on GitHub
☆12Jun 14, 2021Updated 5 years ago
CausalML / interventions-disparate-impact-responders
View on GitHub
Assessing Disparate Impacts of Personalized Interventions: Identifiability and Bounds
☆11Oct 28, 2019Updated 6 years ago