Build and deploy a serverless data pipeline on AWS with no effort.
☆111Feb 8, 2023Updated 3 years ago
Alternatives and similar repositories for datajob
Users that are interested in datajob are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Glue VSCode devcontainer setup☆14Jan 31, 2023Updated 3 years ago
- Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data☆105Jun 25, 2021Updated 4 years ago
- Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc☆391Jun 30, 2024Updated last year
- control spark-shell from vim☆11Oct 27, 2016Updated 9 years ago
- Guide on how to setup Apache Airflow containers using Docker and IBM Bluemix☆11Feb 19, 2018Updated 8 years ago
- The elegance of Airflow + the power of AWS☆51Feb 5, 2024Updated 2 years ago
- This Guidance demonstrates how you can automate your carbon footprint tracking with the Sustainability Insights Framework (SIF) on AWS☆29Oct 20, 2024Updated last year
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data…☆243May 12, 2024Updated last year
- Repo contains Jupyter notebooks compiled during my review of the programming books listed.☆13Mar 9, 2022Updated 4 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36May 18, 2023Updated 2 years ago
- The sample code provides a deploy function and an executable to easily deploy an Amazon Lex bot based on a Lex Schema file.☆23Nov 2, 2023Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Jan 31, 2022Updated 4 years ago
- ☆15Dec 20, 2020Updated 5 years ago
- Exploration of Health-Related Tweets through Topic Modeling & Sentiment Analysis☆20Apr 17, 2024Updated last year
- An open-source AutoML Library based on PyTorch☆309Jan 5, 2026Updated 2 months ago
- This application "listens" for a ticket creation event from Zendesk, analyses the ticket for negative sentiment, tags the ticket accordin…☆14Mar 10, 2025Updated last year
- Multivariate Boosted TRee☆62Oct 3, 2022Updated 3 years ago
- A deep learning application to simulate Holi effect for your group pictures.☆11Jan 17, 2021Updated 5 years ago
- This construct builds some elements for you to quickly launch an EMR Serverless application. After submitting the Emr Serverless job, you…☆11Nov 18, 2025Updated 4 months ago
- All your AWS Stepfunctions at a glance in the terminal! 🧐☆28May 31, 2022Updated 3 years ago
- Blog post on ETL pipelines with Airflow☆24Aug 31, 2025Updated 6 months ago
- Search engine for finding and downloading debate evidence☆40Jan 25, 2023Updated 3 years ago
- A CLI to convert SQL models across database dialects in your dbt projects.☆15May 6, 2025Updated 10 months ago
- ☆16Aug 26, 2021Updated 4 years ago
- AWS Quick Start Team☆39Oct 3, 2024Updated last year
- That is an AWS CDK custom construct based on Tony's amazing Prowler Security, Hardening, Best Practises Tool https://github.com/toniblyx/…☆24May 10, 2023Updated 2 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Dec 2, 2021Updated 4 years ago
- Automated Jupyter notebook testing. 📙☆41Jan 25, 2024Updated 2 years ago
- AWS AppSync resolver that provides GraphQL access to Athena databases☆14Oct 19, 2022Updated 3 years ago
- Minimal implementation of Denoised Smoothing (https://arxiv.org/abs/2003.01908) in TensorFlow.☆20Aug 4, 2021Updated 4 years ago
- ☆17Sep 22, 2020Updated 5 years ago
- Slides and code for the PyData Berlin 2018 tutorial☆16Nov 21, 2022Updated 3 years ago
- Resources to share for people attending the Innovate AI & ML 2021 Event.☆16Apr 4, 2025Updated 11 months ago
- AWS AppSync Session Manager - a sample AppSync project with Amazon Neptune☆13Aug 16, 2018Updated 7 years ago
- A Data Platform built for AWS, powered by Kubernetes.☆147Jul 24, 2023Updated 2 years ago
- Self-exploratory Streamlit app to know more about palmer penguins.☆11Jun 26, 2023Updated 2 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆119Aug 3, 2021Updated 4 years ago
- Deploying a serverless inference service with Amazon SageMaker Pipelines, AWS Lambda, Amazon API Gateway, and CDK☆13Apr 12, 2021Updated 4 years ago
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆698Jan 13, 2026Updated 2 months ago