An open-source package for python to clean raw text data
☆79Aug 8, 2023Updated 2 years ago
Alternatives and similar repositories for cleantext
Users that are interested in cleantext are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ranking and Selecting Multi-Hop Knowledge Paths to Better Predict Human Needs (NAACL 2019)☆16Mar 22, 2021Updated 5 years ago
- R package: Manifold learning in R☆13Apr 7, 2020Updated 6 years ago
- Python parser for the Archie Markup Language (ArchieML)☆12Nov 7, 2021Updated 4 years ago
- Scripts for KGIRNet model for ESWC☆10Jul 6, 2023Updated 2 years ago
- 第一個開放的客語斷詞工具☆13Jun 10, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 6 months ago
- 🧹 Python package for text cleaning☆1,014Updated this week
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 4 years ago
- A quick guide on how to start and setup a Django project with different virtual environments☆11Apr 11, 2018Updated 8 years ago
- ☆16May 14, 2020Updated 6 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Oct 8, 2018Updated 7 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆143Mar 24, 2025Updated last year
- CommonsenseQA☆10Mar 28, 2020Updated 6 years ago
- Contains different projects about data science.☆14Nov 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Apr 8, 2023Updated 3 years ago
- A pluggable Django app to serialize chat messages in a Slack channel.☆10Jan 3, 2023Updated 3 years ago
- A lightweight Python script that fetches data from a Google spreadsheet, transforms to JSON, then optionally commits a data file to a Git…☆10Apr 1, 2026Updated last month
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆21Nov 18, 2024Updated last year
- PyTorch code for NAACL 2022 paper: DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue Generation (https://aclanthology.org/2022.fi…☆16Apr 21, 2026Updated last month
- Sharing a viewer we built for WNYC.☆12May 10, 2011Updated 15 years ago
- Generate beautiful, testable documentation with Jupyter Notebooks☆21Jul 25, 2022Updated 3 years ago
- This is a solution accelerator for creating personalized content recommendations based on user activity.☆13Mar 26, 2024Updated 2 years ago
- Implementation of the paper: Using a KG-Copy Network for Non-Goal Oriented Dialogues☆19Jul 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A demo project and template repository showing how I use SpatiaLite with Datasette for quick spatial analysis.☆17Jul 7, 2024Updated last year
- Data and scripts for examining the Department of Defense's 1033 excess equipment program☆16Jun 21, 2022Updated 3 years ago
- Allows users to instantly reveal who donated to any current lawmakers☆10Jun 18, 2015Updated 10 years ago
- ☆14Jul 18, 2024Updated last year
- Interactive and printable bracket for the NCAA basketball tournament☆16May 6, 2015Updated 11 years ago
- Awesome List for Data Operations☆24Aug 14, 2020Updated 5 years ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Apr 18, 2023Updated 3 years ago
- Basic web and app developer Mac setup in Ansible playbooks.☆21Sep 29, 2015Updated 10 years ago
- This library is for display the XAML code of theme library for WPF (e.g. MaterialDesignInXamlToolkit)☆12Sep 6, 2017Updated 8 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Visualising Sydney bus congestion with Marey charts☆12Nov 23, 2022Updated 3 years ago
- parse uniform crime reporting clearance data☆13Oct 2, 2015Updated 10 years ago
- ☆15Oct 6, 2015Updated 10 years ago
- Tools to create SVG maps with an automatic VML backup for legacy IE browsers using GeoDjango and Raphaël☆18Mar 6, 2012Updated 14 years ago
- Just charts. Really.☆23Sep 3, 2023Updated 2 years ago
- Matplotlib Image labeller for classifying images☆11Apr 6, 2026Updated last month
- A Flexible Deep Learning Approach to Fuzzy String Matching☆151Oct 16, 2024Updated last year