alvenirai / punctfix
☆22Updated last year
Alternatives and similar repositories for punctfix:
Users that are interested in punctfix are comparing it to the libraries listed below
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated last year
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆95Updated 4 months ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- C++ inference engine for running GLiNER (Generalist and Lightweight Named Entity Recognition) models☆28Updated 4 months ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 4 years ago
- A merged version of multiple open-source German speech datasets.☆31Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- Linguistic processing for Common Voice☆55Updated last year
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- 📝An easy-to-use package to restore punctuation of the text.☆115Updated 2 years ago
- ☆56Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated last year
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 2 years ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆69Updated 2 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆98Updated last year
- Generalist and Lightweight Model for Text Classification☆123Updated this week
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆106Updated 2 months ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14Updated 2 years ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- ☆38Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆26Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 3 years ago
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆16Updated 10 months ago
- Using short models to classify long texts☆21Updated 2 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆35Updated 2 years ago
- German small and large versions of GPT2.☆20Updated 2 years ago
- Bicleaner fork that uses neural networks☆40Updated 9 months ago
- ☆43Updated 2 years ago