Build and deploy a serverless data pipeline on AWS with no effort.
β111Feb 8, 2023Updated 3 years ago
Alternatives and similar repositories for datajob
Users that are interested in datajob are comparing it to the libraries listed below
Sorting:
- Powerful rapid automatic EDA and feature engineering library with a very easy to use API πβ53Jan 8, 2022Updated 4 years ago
- β19Oct 10, 2020Updated 5 years ago
- β10Dec 26, 2018Updated 7 years ago
- Multivariate Boosted TReeβ63Oct 3, 2022Updated 3 years ago
- β10Jan 3, 2022Updated 4 years ago
- Self-exploratory Streamlit app to know more about palmer penguins.β11Jun 26, 2023Updated 2 years ago
- Search engine for finding and downloading debate evidenceβ40Jan 25, 2023Updated 3 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognitionβ31Jan 31, 2022Updated 4 years ago
- A comprehensive tool for linguistic analysis of communitiesβ49Oct 1, 2021Updated 4 years ago
- Glue VSCode devcontainer setupβ14Jan 31, 2023Updated 3 years ago
- β10Nov 7, 2020Updated 5 years ago
- On Generating Extended Summaries of Long Documentsβ78Jan 26, 2021Updated 5 years ago
- All your AWS Stepfunctions at a glance in the terminal! π§β28May 31, 2022Updated 3 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.β29Aug 9, 2021Updated 4 years ago
- A deep learning application to simulate Holi effect for your group pictures.β10Jan 17, 2021Updated 5 years ago
- β13Aug 13, 2020Updated 5 years ago
- View a list of JSON-serializable dictionaries or a 2-D array, in HandsOnTable, in Jupyter Notebook.β13Oct 11, 2018Updated 7 years ago
- An open-source AutoML Library based on PyTorchβ309Jan 5, 2026Updated last month
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)β30Dec 31, 2024Updated last year
- The Endatabas Bookβ16Aug 22, 2024Updated last year
- Manage your project and team road maps in YAMLβ15Feb 20, 2026Updated last week
- Operations Research Algorithmsβ19Mar 20, 2024Updated last year
- β16Aug 26, 2021Updated 4 years ago
- Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.β14Nov 9, 2023Updated 2 years ago
- β15Dec 20, 2020Updated 5 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"β55Dec 2, 2021Updated 4 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β37Apr 5, 2022Updated 3 years ago
- Small script for automating mkgendocs and mkdocs filesβ19Apr 14, 2022Updated 3 years ago
- Singularity Global Client for container managementβ15Jun 20, 2023Updated 2 years ago
- Slides and code for the PyData Berlin 2018 tutorialβ16Nov 21, 2022Updated 3 years ago
- Repo contains Jupyter notebooks compiled during my review of the programming books listed.β13Mar 9, 2022Updated 3 years ago
- A Python wrapper for the Australian Bureau of Meteorology's Space Weather API.β17Jan 24, 2025Updated last year
- A Data Platform built for AWS, powered by Kubernetes.β147Jul 24, 2023Updated 2 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.β36May 18, 2023Updated 2 years ago
- β16May 1, 2023Updated 2 years ago
- A serverless datalake project and framework based on AWS S3οΌGlueοΌAthenaοΌMWAA and QuickSight. With a series of best practices, it guides yβ¦β16Nov 22, 2022Updated 3 years ago
- Fine-grained, dynamic control of neural network topology in JAX.β21Jul 23, 2023Updated 2 years ago
- FUSE filesystem for the DNAnexus storage systemβ13Jan 26, 2026Updated last month
- Low-code pre-built pipelines for experiments with huggingface/transformers for Data Scientists in a rush.β16Oct 14, 2020Updated 5 years ago