samsucik / knowledge-distil-bertLinks
Master's thesis project in collaboration with Rasa, focusing on knowledge distillation from BERT into different very small networks and analysis of the students' NLP capabilities.
☆13Updated 2 years ago
Alternatives and similar repositories for knowledge-distil-bert
Users that are interested in knowledge-distil-bert are comparing it to the libraries listed below
Sorting:
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- ☆50Updated 3 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Updated 4 months ago
- ☆15Updated 6 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆42Updated 2 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 3 years ago
- CodeSwitch is a NLP tool, can use for language identification, pos tagging, name entity recognition, sentiment analysis of code mixed dat…☆36Updated 4 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 5 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated last year
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 6 years ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 5 years ago
- Convert words to numbers☆21Updated 3 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Updated 3 years ago
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆63Updated 4 years ago
- Normalize text string☆12Updated 6 years ago
- A web application that interfaces two GEC systems. [web instance is down]☆32Updated last year
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- Language identification and normalisation in code switching data tailored with a three-step decoding process☆24Updated 5 years ago
- ☆22Updated 3 years ago
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆41Updated 2 weeks ago
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆116Updated 4 months ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Updated 8 years ago
- A Word Aligner for English☆11Updated 8 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Updated 5 years ago