catid / dataloader
High-performance tokenized language data-loader for Python C++ extension
☆12Updated 6 months ago
Alternatives and similar repositories for dataloader:
Users that are interested in dataloader are comparing it to the libraries listed below
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Training hybrid models for dummies.☆18Updated 2 weeks ago
- ☆18Updated 9 months ago
- ☆11Updated 8 months ago
- Implementation of a holodeck, written in Pytorch☆17Updated last year
- Minimum Description Length probing for neural network representations☆18Updated this week
- Describe the format of image/text datasets☆11Updated 2 years ago
- List of awesome JAX resources☆13Updated 2 years ago
- HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Updated last year
- A sample pattern for running CI tests on Modal☆14Updated 4 months ago
- Implementation of Spectral State Space Models☆16Updated 11 months ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆14Updated 3 years ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 3 months ago
- ☆13Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 8 months ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆17Updated last year
- Write your code as tree-like expressions, then transform it☆21Updated last year
- You should use PySR to find scaling laws. Here's an example.☆33Updated last year
- ☆17Updated last month
- ☆40Updated 2 months ago
- Code for the paper: https://arxiv.org/pdf/2309.06979.pdf☆18Updated 6 months ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- Utilities for PyTorch distributed☆23Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated last year
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Updated 3 years ago
- JAX/Flax implementation of the Hyena Hierarchy☆33Updated last year
- A framework for implementing equivariant DL☆10Updated 3 years ago
- ☆20Updated 3 years ago
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)☆18Updated 3 years ago