sayakpaul / count-tokens-hf-datasets
This project shows how to derive the total number of training tokens from a large text dataset from π€ datasets with Apache Beam and Dataflow.
β25Updated 2 years ago
Alternatives and similar repositories for count-tokens-hf-datasets:
Users that are interested in count-tokens-hf-datasets are comparing it to the libraries listed below
- PyTorch implementation of GLOMβ22Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β93Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorchβ37Updated 3 years ago
- Embedding Recycling for Language modelsβ38Updated last year
- A multi-label text classifier to predict the subject areas of arXiv papers from their abstract bodies.β17Updated 3 years ago
- Fastai community entry to 2020 Reproducibility Challengeβ18Updated 2 years ago
- β21Updated 3 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.β12Updated last year
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.β22Updated 2 years ago
- β28Updated last year
- Official code release for the paper Coder Reviewer Reranking for Code Generation.β42Updated 2 years ago
- Ranking of fine-tuned HF models as base models.β35Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixingβ48Updated 3 years ago
- Companion Repo for the Vision Language Modelling YouTube series - https://bit.ly/3PsbsC2 - by Prithivi Da. Open to PRs and collaborationsβ14Updated 2 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.β14Updated 3 years ago
- Official repository with code and data accompanying the NAACL 2021 paper "Hurdles to Progress in Long-form Question Answering" (https://aβ¦β46Updated 2 years ago
- β96Updated last year
- Google's BigBird (Jax/Flax & PyTorch) @ π€Transformersβ49Updated 2 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."β46Updated 8 months ago
- Shows how to do parameter ensembling using differential evolution.β10Updated 3 years ago
- β54Updated 2 years ago
- Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorchβ45Updated 4 years ago
- Helper scripts and notes that were used while porting various nlp modelsβ46Updated 3 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"β28Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbolsβ15Updated 3 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retainβ33Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transforβ¦β47Updated last year
- β77Updated last year
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)β60Updated 2 years ago
- My explorations into editing the knowledge and memories of an attention networkβ34Updated 2 years ago