An open-source package for python to clean raw text data
☆79Aug 8, 2023Updated 2 years ago
Alternatives and similar repositories for cleantext
Users that are interested in cleantext are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Ranking and Selecting Multi-Hop Knowledge Paths to Better Predict Human Needs (NAACL 2019)☆16Mar 22, 2021Updated 5 years ago
- R package: Manifold learning in R☆14Apr 7, 2020Updated 6 years ago
- Python parser for the Archie Markup Language (ArchieML)☆12Nov 7, 2021Updated 4 years ago
- 第一個開放的客語斷詞工具☆13Jun 10, 2018Updated 8 years ago
- 🧹 Python package for text cleaning☆1,022May 15, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- aws lamda fastapi with serverless☆17Dec 1, 2024Updated last year
- Web based semantic visualization tool☆12Feb 16, 2017Updated 9 years ago
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 4 years ago
- ☆10Jun 23, 2018Updated 8 years ago
- ☆16Nov 5, 2018Updated 7 years ago
- Dynamic Topic Modeling and Topic Chains of Reuters News Articles using SCVB0☆24Jan 12, 2017Updated 9 years ago
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Oct 8, 2018Updated 7 years ago
- Defeasible Natural Language Inference☆14Dec 4, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- All things involving lavaan☆14Mar 13, 2013Updated 13 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretext☆143Mar 24, 2025Updated last year
- ☆11Jun 3, 2021Updated 5 years ago
- A pluggable Django app to serialize chat messages in a Slack channel.☆10Jan 3, 2023Updated 3 years ago
- ☆10Apr 8, 2023Updated 3 years ago
- Two algorithms based on linear programming to discover classification rules for interpretable learning.☆23Jun 13, 2025Updated last year
- finds a different set of words that sound like the input☆10Feb 24, 2022Updated 4 years ago
- A lightweight Python script that fetches data from a Google spreadsheet, transforms to JSON, then optionally commits a data file to a Git…☆10Apr 1, 2026Updated 2 months ago
- Sharing a viewer we built for WNYC.☆12May 10, 2011Updated 15 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Los Angeles Times analysis of water usage after the state eased drought restrictions☆12Mar 19, 2021Updated 5 years ago
- Generate beautiful, testable documentation with Jupyter Notebooks☆21Jul 25, 2022Updated 3 years ago
- This is a solution accelerator for creating personalized content recommendations based on user activity.☆13Mar 26, 2024Updated 2 years ago
- ☆12Apr 1, 2022Updated 4 years ago
- yeoman generator for newsapps.☆15Jun 3, 2015Updated 11 years ago
- Jupyter notebooks - A tool to write and share executable notebooks and data visualization☆10Feb 5, 2026Updated 4 months ago
- A demo project and template repository showing how I use SpatiaLite with Datasette for quick spatial analysis.☆17Jul 7, 2024Updated last year
- Data and scripts for examining the Department of Defense's 1033 excess equipment program☆16Jun 21, 2022Updated 4 years ago
- Allows users to instantly reveal who donated to any current lawmakers☆10Jun 18, 2015Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Notes and activity code for the "Python 3: Data cleaning and visualization with pandas and matplotlib" session at the 2018 NICAR conferen…☆11Jun 1, 2021Updated 5 years ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Apr 18, 2023Updated 3 years ago
- This library is for display the XAML code of theme library for WPF (e.g. MaterialDesignInXamlToolkit)☆12Sep 6, 2017Updated 8 years ago
- A cookiecutter template for creating a tvOS project running Python code.☆17Jul 11, 2016Updated 9 years ago
- parse uniform crime reporting clearance data☆13Oct 2, 2015Updated 10 years ago
- A Python module to convert natural language numerics into ints and floats.☆233Sep 26, 2024Updated last year
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago