rom1504 / any2dataset
Turn any collection of files into a dataset
☆43Updated last year
Alternatives and similar repositories for any2dataset:
Users that are interested in any2dataset are comparing it to the libraries listed below
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- Load any clip model with a standardized interface☆21Updated 8 months ago
- A client library for LAION's effort to filter CommonCrawl with CLIP, building a large scale image-text dataset.☆32Updated last year
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- A dashboard for exploring timm learning rate schedulers☆19Updated last month
- [WIP] A 🔥 interface for running code in the cloud☆86Updated last year
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- ☆42Updated 2 years ago
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Latent Diffusion Language Models☆68Updated last year
- ☆15Updated last month
- **ARCHIVED** Filesystem interface to 🤗 Hub☆57Updated last year
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆47Updated 2 years ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆44Updated 3 months ago
- An open source implementation of CLIP.☆32Updated 2 years ago
- ☆26Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 6 months ago
- JAX implementation ViT-VQGAN☆56Updated 2 years ago
- ☆27Updated 3 years ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Aggregating embeddings over time☆31Updated 2 years ago
- ☆22Updated last year