alvenirai / punctfix
☆22Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for punctfix
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆93Updated 3 weeks ago
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 7 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 5 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆88Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- An extension package of 🤗 Datasets that provides support for executing arbitrary SQL queries on HF datasets☆31Updated 9 months ago
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆57Updated 6 months ago
- Asent is a python library for performing efficient and transparent sentiment analysis using spaCy.☆115Updated 7 months ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆28Updated last year
- A merged version of multiple open-source German speech datasets.☆30Updated 6 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆95Updated 6 months ago
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆9Updated last week
- RaKUn 2.0 - A fast keyword detection algorithm☆65Updated 3 months ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆36Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆85Updated last month
- Using short models to classify long texts☆20Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 8 months ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- Sentence transformers models for SpaCy☆105Updated last year
- Sentiment Corpus for Swedish 🇸🇪 Norwegian 🇳🇴 Danish 🇩🇰 Finnish 🇫🇮 (and English 🏴)☆15Updated 3 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 2 years ago
- This is a neural spell checker☆60Updated last year
- Library for fast text representation and classification.☆28Updated 10 months ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆75Updated 2 months ago
- A french sequence to sequence pretrained model☆57Updated 2 years ago
- Annotated corpus + evaluation metrics for text anonymisation☆51Updated 9 months ago
- Temporary remove unused tokens during training to save ram and speed.☆22Updated 4 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆72Updated last year