samsucik / knowledge-distil-bertLinks
Master's thesis project in collaboration with Rasa, focusing on knowledge distillation from BERT into different very small networks and analysis of the students' NLP capabilities.
☆13Updated 2 years ago
Alternatives and similar repositories for knowledge-distil-bert
Users that are interested in knowledge-distil-bert are comparing it to the libraries listed below
Sorting:
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆63Updated 4 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆41Updated 2 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 11 months ago
- ☆49Updated 3 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 8 months ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Updated 5 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- Multilingual text corpus designed to study multilingual and cross-lingual natural language understanding (NLU) models and the strategies …☆13Updated last month
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Updated 4 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- zero-vocab or low-vocab embeddings☆18Updated 3 years ago
- Assessing syntactic abilities of BERT☆39Updated 6 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- ☆17Updated 11 months ago
- Use BERT to Fill in the Blanks☆83Updated 3 years ago
- Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions an…☆12Updated 2 years ago
- Dual Encoders for State-of-the-art Natural Language Processing.☆61Updated 3 years ago
- Dynamic ensemble decoding with transformer-based models☆29Updated 2 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆55Updated 4 years ago
- Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation☆32Updated 2 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Updated 7 years ago
- Code for "Planning and Generating Natural and Diverse Disfluent Texts as Augmentation for Disfluency Detection"☆15Updated 3 years ago
- A generic library for crafting adversarial NLP examples - WIP☆41Updated 6 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Updated 8 years ago