ad-freiburg / whitespace-correctionLinks
Fast whitespace correction with Transformers
☆16Updated 2 months ago
Alternatives and similar repositories for whitespace-correction
Users that are interested in whitespace-correction are comparing it to the libraries listed below
Sorting:
- zero shot NER fine tuning☆13Updated 4 months ago
- ☆34Updated 4 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆13Updated 2 years ago
- BotSIM - a data-efficient end-to-end Bot SIMulation toolkit for evaluation, diagnosis, and improvement of commercial chatbots☆116Updated 2 months ago
- Implementation of Z-BERT-A: a zero-shot pipeline for unknown intent detection.☆41Updated 2 years ago
- zero-vocab or low-vocab embeddings☆18Updated 3 years ago
- Repository for Findings of EMNLP 2020 "Context-aware Stand-alone Neural Spelling Correction"☆18Updated 4 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Finite-state script normalization and processing utilities☆40Updated this week
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalenc…☆55Updated 11 months ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 4 years ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆14Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A tiny BERT for low-resource monolingual models☆31Updated 9 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The tex…☆53Updated 4 years ago
- ☆12Updated 2 years ago
- Correction of spaces with character-based neural language models.☆13Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- BERT-based joint intent detection and slot filling with intent-slot attention mechanism (INTERSPEECH 2021)☆87Updated 11 months ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆68Updated 2 years ago
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆73Updated last year
- ☆57Updated 3 years ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆30Updated 4 months ago
- Personal information identification standard☆21Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- Transformation spoken text to written text☆30Updated last year