rom1504 / any2dataset
Turn any collection of files into a dataset
☆44Updated 2 years ago
Alternatives and similar repositories for any2dataset:
Users that are interested in any2dataset are comparing it to the libraries listed below
- Load any clip model with a standardized interface☆21Updated 11 months ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- ☆43Updated 2 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 2 years ago
- A JAX nn library☆21Updated last month
- A library for squeakily cleaning and filtering language datasets.☆46Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated 7 months ago
- Simple python template☆40Updated 11 months ago
- PyTorch interface for TrueGrad Optimizers☆42Updated last year
- An open source implementation of CLIP.☆32Updated 2 years ago
- ☆21Updated 3 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆45Updated last month
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- Latent Diffusion Language Models☆68Updated last year
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- ☆26Updated 2 years ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- Another attempt at a long-context / efficient transformer by me