A parallel implementation of the bzip2 data compressor in python, this data compression pipeline is using algorithms like Burrows–Wheeler transform (BWT) and Move to front (MTF) to improve the Huffman compression. For now, this tool only will be focused on compressing .csv files, and other files on tabular format.
☆13Jun 29, 2022Updated 3 years ago
Alternatives and similar repositories for wbz
Users that are interested in wbz are comparing it to the libraries listed below
Sorting:
- Scheduling Big Data Workloads and Data Pipelines in the Cloud with pyDag☆23Sep 19, 2022Updated 3 years ago
- The goal of this project is to identify students at risk of dropping out the school☆22May 7, 2021Updated 4 years ago
- Challenge Data Engineer☆25Jun 13, 2022Updated 3 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on…☆28Jun 13, 2022Updated 3 years ago
- Build a Content-Based Movie Recommender System (TF-IDF, BM25, BERT)☆13Jun 13, 2022Updated 3 years ago
- The goal of this project is to track the expenses of Uber Rides and Uber Eats through data Engineering processes using technologies such …☆123Jun 29, 2022Updated 3 years ago
- Dockerizing a Python Script for Web Scraping and consume the scraped data using FastApi (www.metroscubicos.com)☆15Dec 16, 2021Updated 4 years ago
- Find out which countries have won the most medals and how the participation of nations has changed over time, with R☆10Aug 22, 2021Updated 4 years ago
- DataForge helps data teams write functional transformation pipelines by leveraging software engineering principles☆59Feb 27, 2026Updated last week
- Simple chatbot created using Rasa☆10Feb 20, 2021Updated 5 years ago
- Color detection beginner data science project☆13Dec 6, 2020Updated 5 years ago
- ☆12Sep 21, 2023Updated 2 years ago
- This repository contains a visual studio project for training a classifier on the mnist dataset using the libtorch c++ wrapper.☆12Oct 13, 2020Updated 5 years ago
- 🌌 Real-time threat detection for smart contracts☆10May 16, 2023Updated 2 years ago
- Natural Language Processing Project☆11Jul 6, 2021Updated 4 years ago
- A repository for Analysis of Toronto Neighbourhoods (An IBM Data Science Capstone Project)☆10Jan 15, 2021Updated 5 years ago
- StyleGAN2-ADA for generation of synthetic skin lesions☆13Aug 2, 2023Updated 2 years ago
- Predict customer churn with text and interpretability.☆12Sep 20, 2021Updated 4 years ago
- Sentiment Analysis of COVID-19 Vaccine-related Twitter Data☆10May 30, 2021Updated 4 years ago
- This tool designs guides for use with the base editor technology.☆12Aug 11, 2023Updated 2 years ago
- Programming for Biology @ CSHL 2023☆14Oct 17, 2025Updated 4 months ago
- ☆11Oct 11, 2020Updated 5 years ago
- Host Your Own Offline Mapping Server by running the provided Jupyter Notebook.☆11Oct 5, 2021Updated 4 years ago
- ☆12Jan 20, 2024Updated 2 years ago
- A Jupyter project that demonstrates how to access local data from OpenStreetMap to improve your ML models. Demonstrates the use of K-D Tr…☆12Sep 16, 2020Updated 5 years ago
- word4num is a versatile tool for encoding numbers into words, applicable for geolocation, phone numbers, postcodes, IPv4 addresses, and m…☆12Oct 9, 2024Updated last year
- ☆10May 16, 2022Updated 3 years ago
- A model to analyse the performance of soccer players☆12Oct 28, 2020Updated 5 years ago
- An elaborate approach for ABC-XYZ Analysis☆11May 10, 2020Updated 5 years ago
- ☆10Dec 3, 2020Updated 5 years ago
- EfficientNet model is fine-tuned on facial expressions to detect 6 of the basic emotions☆11May 27, 2021Updated 4 years ago
- ClusterV: finding HIV quasispecies and drug resistance from ONT sequencing data☆12Jan 7, 2025Updated last year
- Learn Python Data Analytics By Example: Airline Arrival Delays☆10Apr 3, 2023Updated 2 years ago
- Analyzing the most strategic words to guess on Wordle, based on letter frequency distributions☆11Feb 20, 2022Updated 4 years ago
- This repository is the project page for "Point Anywhere: Directed Object Estimation from Omnidirectional Images", including source code …☆12Aug 25, 2023Updated 2 years ago
- ☆10Dec 14, 2020Updated 5 years ago
- A downloadable pdf containing summary of frequently used pandas operations.☆10Sep 26, 2020Updated 5 years ago
- LAAVA: Long-read AAV Analysis☆13Dec 9, 2025Updated 2 months ago