Dwolla/arbalest

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Dwolla/arbalest)

Dwolla / arbalest

Arbalest is a Python data pipeline orchestration library for Amazon S3 and Amazon Redshift. It automates data import into Redshift and makes data queryable at scale in AWS.

☆39

Alternatives and similar repositories for arbalest

Users that are interested in arbalest are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SamuraiT / pandas-rs
View on GitHub
Pandas extension for PostgreSQL and RedShift
☆16Nov 13, 2015Updated 10 years ago
JonathanMace / tpcds
View on GitHub
TPC-DS benchmarks including data generation with Spark and queries with Spark
☆15May 8, 2017Updated 9 years ago
uranusjr / django-gunicorn
View on GitHub
Run Django development server with Gunicorn.
☆14Apr 12, 2016Updated 10 years ago
jacobian / slackline
View on GitHub
WIP: Slack Bots using Django Channels
☆19Apr 14, 2016Updated 10 years ago
coursera / dataduct
View on GitHub
DataPipeline for humans.
☆250Jul 21, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
asandeep / airflow-ecr-plugin
View on GitHub
Airflow AWS ECR integration
☆10Feb 25, 2020Updated 6 years ago
mrafayaleem / simple-crawler
View on GitHub
A super simple webcrawler framework written in Python.
☆24Mar 8, 2016Updated 10 years ago
optimizely / chef-druid
View on GitHub
Chef cookbook for the http://druid.io/
☆10Apr 25, 2016Updated 10 years ago
natgaertner / ersatzpg
View on GitHub
A version of pgloader stripped down to components I found essential. Sacrifices a lot of flexibility, but throws out some overhead
☆18Jun 5, 2013Updated 13 years ago
remeniuk / rubbercube
View on GitHub
Cubes over ElasticSearch. Aggregation library for Business Intelligence
☆20Dec 14, 2014Updated 11 years ago
nahuelcandia / mobile-app-maker
View on GitHub
Open Source CMS and Mobile app maker, that enables non developers (but also developers) to create flexible and powerful mobile applicatio…
☆25Dec 9, 2017Updated 8 years ago
heroku / shh
View on GitHub
Shh - Systems Heuristics Herald
☆39Jun 2, 2026Updated last month
KleinYuan / tf-blocks
View on GitHub
Building blocks of tensorflow architectures
☆11Oct 14, 2019Updated 6 years ago
motiz88 / jsgrest
View on GitHub
Postgres REST API server in JavaScript (a la PostgREST)
☆12Dec 28, 2017Updated 8 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Wikia / discreETLy
View on GitHub
ETLy is an add-on dashboard service on top of Apache Airflow.
☆69Jul 21, 2023Updated 3 years ago
ychantit / airflow_aws_utils
View on GitHub
A collection of airflow sample workflows for data processing on aws
☆12Dec 1, 2017Updated 8 years ago
keithrozario / S3-71
View on GitHub
Copy millions of objects in minutes
☆12Oct 21, 2019Updated 6 years ago
datamill-co / target-redshift
View on GitHub
A Singer.io Target for Redshift
☆23Jun 1, 2021Updated 5 years ago
openmailold / redshift_show_create_table
View on GitHub
python script, 'show create table' equivalent for aws redshift
☆32Aug 5, 2016Updated 9 years ago
joestubbs / endofday
View on GitHub
Execute pipelines and other workflows of docker containers.
☆11Oct 1, 2016Updated 9 years ago
jmarshallnz / statsNZ
View on GitHub
R package for accessing the StatisticsNZ API
☆10Feb 20, 2023Updated 3 years ago
snowplow-archive / icebucket
View on GitHub
UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage
☆14Sep 10, 2015Updated 10 years ago
jborowitz / x12
View on GitHub
Python code to seasonally adjust data using the census X12-ARIMA program: http://www.census.gov/srd/www/x12a/
☆11Mar 22, 2012Updated 14 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Urigo / Thinkster-MEAN-Tutorial-in-angular-meteor
View on GitHub
angular-meteor version of Thinkster.io's mean-stack-tutorial
☆10Oct 16, 2017Updated 8 years ago
FreckleIOT / ecs-airflow
View on GitHub
Cloudformation templates for deploying Airflow in ECS
☆40Nov 27, 2018Updated 7 years ago
gotitinc / mongo-bigquery
View on GitHub
Load your MongoDB collection into Google BigQuery. Supports complex JSON structure.
☆28Sep 25, 2016Updated 9 years ago
jimthompson5802 / kaggle-BNP-Paribas
View on GitHub
Kaggle Competition BNP Pairbas Cardif Claims Management: Rank 133 out of 2,926 (Top 5%)
☆14May 10, 2016Updated 10 years ago
criteo / berilia
View on GitHub
Create hadoop cluster in aws ec2 for development
☆11Sep 8, 2017Updated 8 years ago
Newmu / Salary-Prediction
View on GitHub
Code for the Adzuna Salary Prediction Kaggle competition - http://www.kaggle.com/c/job-salary-prediction Placed 10th out of approximately…
☆11Apr 10, 2013Updated 13 years ago
RefugeesOnRails / machina
View on GitHub
Script to set up the Machines for Refugees On Rails
☆11Sep 17, 2017Updated 8 years ago
snowplow / data-models
View on GitHub
⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.
☆42Jan 8, 2025Updated last year
geowurster / NewlineJSON
View on GitHub
Streaming newline delimited JSON I/O.
☆12Jun 30, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
manugarri / Reddit-Recommendation-Engine
View on GitHub
Implementation of a Recommendation Engine for Reddit
☆12Nov 19, 2014Updated 11 years ago
ecerami / hello_flask
View on GitHub
Simplest example of flask, pandas and plotly.
☆16Dec 29, 2015Updated 10 years ago
jpayne0061 / python_crawler
View on GitHub
this script script no longer works due to changes in Amazon's servers
☆10Mar 12, 2017Updated 9 years ago
vsouza / spark-kinesis-redshift
View on GitHub
Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark
☆11May 22, 2018Updated 8 years ago
weaponsjtu / Kaggle_xBle
View on GitHub
A Swiss Army Knife for Machine Learning Practice, cross validation, model selection, ensemble selection, stacking
☆16May 11, 2016Updated 10 years ago
kassambara / r2excel
View on GitHub
Read, write and format Excel files using R
☆15Sep 29, 2025Updated 10 months ago
symphonyrm / lookml-gen
View on GitHub
Generate LookML with Python code
☆30Apr 13, 2026Updated 3 months ago