huggingface / tflite-android-transformers
DistilBERT / GPT-2 for on-device inference thanks to TensorFlow Lite with Android demo apps
☆390Updated last year
Related projects: ⓘ
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.☆251Updated last year
- State of the art faster Transformer with Tensorflow 2.0 ( NLP, Computer Vision, Audio ).☆85Updated last year
- 📲 Transformers android examples (Tensorflow Lite & Pytorch Mobile)☆78Updated last year
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆561Updated last year
- Repository for the paper "Optimal Subarchitecture Extraction for BERT"☆469Updated 2 years ago
- FastFormers - highly efficient transformer models for NLU☆702Updated 8 months ago
- TFLite Support is a toolkit that helps users to develop ML and deploy TFLite models onto mobile / ioT devices.☆369Updated 3 weeks ago
- An open clone of the GPT-2 WebText dataset by OpenAI. Still WIP.☆380Updated 5 months ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆432Updated 2 years ago
- EMNLP 2020: "Dialogue Response Ranking Training with Large-Scale Human Feedback Data"☆335Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆125Updated 3 years ago
- Project tracking of the "Mobile ML Working Group", for the End-to-End TensorFlow Lite tutorials.☆133Updated 2 years ago
- ☆411Updated 10 months ago
- OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0☆257Updated last year
- How to create Selfie2Anime from tflite model to Android.☆74Updated 3 years ago
- Examples for using ONNX Runtime for model training.☆301Updated last month
- Prune a model while finetuning or training.☆393Updated 2 years ago
- Fast Inference Solutions for BLOOM☆556Updated last month
- Interactive Neural Machine Translation-lite (INMT-lite) is a framework to train and develop lite versions (.tflite) of models for neural …☆45Updated 11 months ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆428Updated last year
- ☆480Updated 7 months ago
- A search engine for ParlAI's BlenderBot project (and probably other ones as well)☆132Updated 2 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆776Updated last year
- xfspell — the Transformer Spell Checker☆186Updated 4 years ago
- Question Answering using Albert and Electra☆205Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆152Updated 9 months ago
- An awesome list of TensorFlow Lite models, samples, tutorials, tools and learning resources.☆1,167Updated 2 years ago
- Examples of Tensorflow Lite on Android☆66Updated last year
- Accelerate PyTorch models with ONNX Runtime☆353Updated 2 weeks ago
- This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.☆170Updated last year