richarddwang / hugdatafast
The elegant integration of huggingface/nlp and fastai2 and handy transforms using pure huggingface/nlp
☆19Updated 4 years ago
Alternatives and similar repositories for hugdatafast:
Users that are interested in hugdatafast are comparing it to the libraries listed below
- A package for fine-tuning Transformers with TPUs, written in Tensorflow2.0+☆37Updated 4 years ago
- A collection of Models, Datasets, DataModules, Callbacks, Metrics, Losses and Loggers to better integrate pytorch-lightning with transfor…☆47Updated last year
- Convenient DL serving☆72Updated 3 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- GNES Hub ship AI/ML models as Docker containers and use Docker containers as plugins.☆34Updated 5 years ago
- TPU index is a package for fast similarity search over large collections of high dimension vectors on TPUs☆17Updated 3 years ago
- The stand-alone training engine module for the ALOHA.eu project.☆15Updated 5 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Updated 4 years ago
- A deep learning library based on Pytorch focussed on low resource language research and robustness☆69Updated 3 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- DEPRECATED--all functionality moved to nbdev☆15Updated 2 years ago
- No Teacher BART distillation experiment for NLI tasks☆26Updated 4 years ago
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated last year
- Large Scale BERT Distillation☆32Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated 10 months ago
- Simple dataset to dataloader library for pytorch☆33Updated 3 months ago
- ☆28Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- This repository contains example code to build models on TPUs☆30Updated 2 years ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆25Updated 2 years ago
- Large dataset storage format for Pytorch☆45Updated 3 years ago
- Code repo for "Transformer on a Diet" paper☆31Updated 4 years ago
- Using Fastai library to classify Twitter jokes in Spanish☆12Updated 5 years ago
- Code for scaling Transformers☆26Updated 4 years ago
- ELECTRA MODEL NLP☆13Updated 5 years ago
- Transformer based Trigram Blocking implementation in Tensorflow☆11Updated 5 years ago
- Transformer block in tf.keras similar to PyTorch's nn.Transformer block.☆9Updated 4 years ago
- TPU support for the fastai library☆13Updated 4 years ago
- Process Google Dataset is a tool to download and process images for neural networks from a Google Image Search using a Chrome extension a…☆33Updated 3 years ago