hauntsaninja / boostedblob
Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage paths.
☆23Updated last month
Related projects ⓘ
Alternatives and complementary repositories for boostedblob
- A library to create and manage configuration files, especially for machine learning projects.☆77Updated 2 years ago
- A sample pattern for running CI tests on Modal☆13Updated last month
- Read Google Cloud Storage, Azure Blobs, and local paths with the same interface☆58Updated 2 months ago
- ☆22Updated last year
- A file utility for accessing both local and remote files through a unified interface.☆36Updated 3 months ago
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- One stop shop for all things carp☆58Updated 2 years ago
- Drop-in replacements for Python's map function☆13Updated last year
- Parallel data preprocessing for NLP and ML.☆33Updated last week
- Mechanistic Interpretability for Transformer Models☆49Updated 2 years ago
- ☆58Updated 2 years ago
- A library to instantiate any Python object from configuration files.☆24Updated 2 years ago
- Ludwig benchmark☆19Updated 2 years ago
- ☆29Updated 2 weeks ago
- Code for SaGe subword tokenizer (EACL 2023)☆22Updated last month
- Fast, Modern, Memory Efficient, and Low Precision PyTorch Optimizers☆58Updated 3 months ago
- https://footprints.baulab.info☆12Updated last month
- Minimum Description Length probing for neural network representations☆16Updated last week
- A place to store reusable transformer components of my own creation or found on the interwebs☆43Updated last week
- we got you bro☆32Updated 3 months ago
- A dataset of alignment research and code to reproduce it☆69Updated last year
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆41Updated 3 years ago
- See the issue board for the current status of active and prospective projects!