tanmoyio / sahajBERTLinks
☆14Updated 3 years ago
Alternatives and similar repositories for sahajBERT
Users that are interested in sahajBERT are comparing it to the libraries listed below
Sorting:
- An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data☆37Updated 4 years ago
- ☆39Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- PyTorch implementation of GLOM☆22Updated 3 years ago
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆18Updated 3 years ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- One stop shop for all things carp☆59Updated 2 years ago
- TPU support for the fastai library☆13Updated 4 years ago
- Sequence models in Numpy☆25Updated 4 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆26Updated 2 years ago
- Version control for software 2.0☆64Updated 4 years ago
- Fastai community entry to 2020 Reproducibility Challenge☆18Updated 2 years ago
- Babysit your preemptible TPUs☆85Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆69Updated 2 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- ☆60Updated 3 years ago
- A GPT, made only of MLPs, in Jax☆58Updated 3 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Python Research Framework☆106Updated 2 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- Memory-efficient transformer. Work in progress.☆19Updated 2 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47Updated 2 years ago
- ☆19Updated 9 months ago
- A Simple but Powerful CNN Trainer For PyTorch☆26Updated 4 years ago
- ☆13Updated 6 years ago
- A queue service for quickly developing scripts that use all your GPUs efficiently☆85Updated 2 years ago
- A collection of reusable, high-performance, well-documented, thorough-tested layers and models in Jax☆15Updated last week
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago