hauntsaninja / boostedblob
Command line tool and async library to perform basic file operations on local paths, Google Cloud Storage paths and Azure Blob Storage paths.
☆25Updated last month
Alternatives and similar repositories for boostedblob:
Users that are interested in boostedblob are comparing it to the libraries listed below
- A library to create and manage configuration files, especially for machine learning projects.☆76Updated 2 years ago
- ☆20Updated last year
- A sample pattern for running CI tests on Modal☆14Updated 5 months ago
- PyTorch interface for TrueGrad Optimizers☆42Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆29Updated 4 months ago
- A dataset of alignment research and code to reproduce it☆73Updated last year
- One stop shop for all things carp☆59Updated 2 years ago
- ☆22Updated last year
- Mechanistic Interpretability for Transformer Models☆49Updated 2 years ago
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- Read Google Cloud Storage, Azure Blobs, and local paths with the same interface☆63Updated 5 months ago
- ☆31Updated 3 weeks ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆48Updated last year
- Opinionated library for managing hyperparameters and mutable state of machine learning training systems.☆19Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆22Updated 2 months ago
- Chat Markup Language conversation library☆55Updated last year
- See the issue board for the current status of active and prospective projects!☆65Updated 3 years ago
- ☆24Updated last year
- Ludwig benchmark☆20Updated 2 years ago
- Python tools☆12Updated last year
- A file utility for accessing both local and remote files through a unified interface.☆37Updated last month
- ☆13Updated last year
- ☆51Updated 5 months ago
- ☆58Updated 2 years ago
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆20Updated last month
- This repository contains code for cleaning your training data of benchmark data to help combat data snooping.☆25Updated last year
- Code to create bugged python scripts for OpenAssistant Training, maintained by https://twitter.com/Cyndesama☆21Updated last year
- Codebase topic modeling using GNNs(Node aggregation and clustering)☆61Updated last year
- Transformer with Mu-Parameterization, implemented in Jax/Flax. Supports FSDP on TPU pods.☆30Updated 2 months ago
- Parallel data preprocessing for NLP and ML.☆34Updated 3 months ago