A python package for text preprocessing task in natural language processing.
☆63Sep 27, 2022Updated 3 years ago
Alternatives and similar repositories for text-preprocessing
Users that are interested in text-preprocessing are comparing it to the libraries listed below
Sorting:
- Source code and data for Like a Good Nearest Neighbor☆30Jan 12, 2025Updated last year
- Fixes contractions such as `you're` to `you are`☆319Nov 15, 2022Updated 3 years ago
- Infographiq, ie intelligent interactive infographics, core JavaScript library☆11Jan 17, 2024Updated 2 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- A Python Wrapper To Retrieve Data From The CrowdTangle API☆11Jun 10, 2025Updated 8 months ago
- Notes and samples for Python performance talk☆10Feb 17, 2022Updated 4 years ago
- Back end for producing indicators and loading them into the COVIDcast API.☆12Updated this week
- Modeling methods of System Dynamics – Supply Chain Simulation using the Anylogic software☆10Jan 8, 2026Updated last month
- Portfolio with data science and machine learning projects I developed during my training in data science.☆10Jan 4, 2021Updated 5 years ago
- LLM Building Blocks for Python Course☆15Nov 17, 2025Updated 3 months ago
- Prototype to detect Spanish hate-speech against women online.☆11Aug 7, 2022Updated 3 years ago
- On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation (Findings of EMNLP 2021))☆13Nov 21, 2021Updated 4 years ago
- 🔎 Finds fuzzy matches between datasets☆16Jan 26, 2026Updated last month
- A stripped down pattern and formula interpreter for Renoise built for generating note and parameter sequences of a track pattern. Should …☆16Jan 18, 2012Updated 14 years ago
- A Laravel 5 console command to create migration files from a MYSQL database☆10Feb 23, 2015Updated 11 years ago
- Code repository for Python for Beginners: Learn Python from Scratch, published by Packt☆16Oct 16, 2023Updated 2 years ago
- The Implementation for the Paper "Time-Stamped Language Model: Teaching Language Models toUnderstand The Flow of Events"☆11May 6, 2021Updated 4 years ago
- Course project for CS410. Drug Molecular Toxicity Prediction with GCN + Cloud ML Infra.☆10Apr 6, 2021Updated 4 years ago
- Using Selenium Chrome Webdriver app searches for given keyword in TikTok videos descriptions, collects video related data, extracts numbe…☆11Nov 29, 2023Updated 2 years ago
- Cubic splines for Julia☆11May 16, 2022Updated 3 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Command Line Interface for IA² models development, training and deployment.☆10Jun 16, 2023Updated 2 years ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- ☆13Oct 3, 2024Updated last year
- ACL Rolling Review website☆11Feb 24, 2026Updated last week
- Univibe vst3 and au plugin☆11Feb 23, 2024Updated 2 years ago
- Generates a tree of an S3 bucket contents☆10Sep 18, 2020Updated 5 years ago
- A detailed implementation of the TrueSkill algorithm in the Java language.☆11Sep 5, 2015Updated 10 years ago
- Combination of the RapidFuzz library with Spacy PhraseMatcher☆11Sep 29, 2021Updated 4 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- ☆16Aug 6, 2023Updated 2 years ago
- ☆11Jul 15, 2020Updated 5 years ago
- Example of how ExternalProject_Add can be used to load a CEF3 build as a dependency in our own cmake project.☆10Sep 1, 2015Updated 10 years ago
- 10th Place Solution for APTOS 2019 Blindness Detection (efficientnet-b5 part)☆10Apr 23, 2020Updated 5 years ago
- Podium: a framework agnostic Python NLP library for data loading and preprocessing☆60Dec 12, 2022Updated 3 years ago
- Helpful data preprocessing, training, and visualisation code and scripts for a range of Kaggle competitions, supported by Weights & Biase…☆15Oct 11, 2022Updated 3 years ago
- DISCO: Comprehensive and Explainable Disinformation Detection, CIKM 2022☆10May 5, 2023Updated 2 years ago
- Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification☆10May 31, 2022Updated 3 years ago
- 2020厦门国际银行数创金融杯建模大赛-优胜奖方案☆11Feb 2, 2021Updated 5 years ago