KamWithK / PyParquetLoaders
Easy, efficient and Pythonic data loading of Parquet files for PyTorch-based libraries
β23Updated 4 years ago
Alternatives and similar repositories for PyParquetLoaders:
Users that are interested in PyParquetLoaders are comparing it to the libraries listed below
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorchβ37Updated 2 years ago
- Quickest way to share everything about your research within a single appβ17Updated 11 months ago
- AdamW optimizer for bfloat16 models in pytorch π₯.β31Updated 7 months ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixingβ47Updated 3 years ago
- My explorations into editing the knowledge and memories of an attention networkβ34Updated 2 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library β€οΈβ53Updated 2 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.β49Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012β49Updated 2 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorchβ12Updated 3 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning Pβ¦β34Updated last year
- Large dataset storage format for Pytorchβ45Updated 3 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"β57Updated last year
- β32Updated 2 years ago
- Another attempt at a long-context / efficient transformer by meβ37Updated 2 years ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.β17Updated 2 months ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).β24Updated last year
- β30Updated this week
- Index of URLs to pdf files all over the internet and scriptsβ21Updated last year
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transforβ¦β47Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchiβ¦β31Updated 8 months ago
- β21Updated 3 years ago
- Utilities for Training Very Large Modelsβ57Updated 4 months ago
- High performance pytorch modulesβ18Updated 2 years ago
- A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.β60Updated 3 weeks ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorchβ35Updated 3 years ago
- Embedding Recycling for Language modelsβ38Updated last year
- TorchFix - a linter for PyTorch-using code with autofix supportβ122Updated 3 weeks ago
- Large Scale Distributed Model Training strategy with Colossal AI and Lightning AIβ58Updated last year
- Transformers at any scaleβ41Updated last year
- Code for the paper PermuteFormerβ42Updated 3 years ago