A tiny library for Python text normalisation. Useful for ad-hoc text processing.
☆156Mar 8, 2026Updated last month
Alternatives and similar repositories for normality
Users that are interested in normality are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆21Apr 10, 2025Updated last year
- Now included in rigour☆151Nov 24, 2025Updated 4 months ago
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 4 months ago
- Fingerpaint with your data.☆18Feb 5, 2012Updated 14 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Data cleaning and validation functions for names, languages, identifiers, etc.☆57Mar 30, 2026Updated last week
- Text normalization library for Python☆201Mar 26, 2018Updated 8 years ago
- A small repo of notes and scripts for collecting data on U.S. deadly force police incidents☆10Aug 9, 2015Updated 10 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 8 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆25Jul 15, 2025Updated 8 months ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Jul 23, 2016Updated 9 years ago
- Archive of political ad data from the Federal Communications Commission☆20Oct 25, 2017Updated 8 years ago
- A selectable, scrollable list interface for terminal applications built using curses☆10Jun 30, 2015Updated 10 years ago
- A demo project and template repository showing how I use SpatiaLite with Datasette for quick spatial analysis.☆17Jul 7, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- 🧹 Python package for text cleaning☆1,005Jan 28, 2026Updated 2 months ago
- Python scraper to get weekly CDC flu surveillance data☆25Dec 2, 2014Updated 11 years ago
- API client for Aleph, supports bulk entity and document upload.☆29Mar 5, 2026Updated last month
- Lightweight web scraping toolkit for documents and structured data.☆315Jan 10, 2024Updated 2 years ago
- Little JSON object want to be graphs, too!☆17Oct 2, 2015Updated 10 years ago
- parse uniform crime reporting clearance data☆13Oct 2, 2015Updated 10 years ago
- Dump (freeze) SQL query results from a database into a selection of file formats☆92May 8, 2019Updated 6 years ago
- Tools for analyzing the Hillary Clinton emails☆13Apr 24, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Platform for journalists to search, analyse, categorise and share unstructured data☆60Updated this week
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆14Apr 1, 2026Updated last week
- Load CSV files into Postgres without explicit schema creation.☆81Jun 26, 2021Updated 4 years ago
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆39Oct 12, 2023Updated 2 years ago
- Obtained in December 2014 through a Freedom of Information request☆15Jan 29, 2016Updated 10 years ago
- Simple way to build a new dict based on fields declaration☆15May 7, 2019Updated 6 years ago
- N-grams approximate string matching implementation in pure Python☆26Sep 20, 2010Updated 15 years ago
- Ask questions about government data.☆38Jan 17, 2019Updated 7 years ago
- Scripts as a service. Builds on systemd (for Linux)☆21Mar 10, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Service to scan licenses from source code☆12Aug 14, 2023Updated 2 years ago
- An open database of international sanctions data, persons of interest and politically exposed persons☆707Updated this week
- The open source platform that securely stores large amounts of data and makes it searchable for easy collaboration.☆65Updated this week
- Watch a git repository, mirror it on a web server, and push to S3 with the appropriate commit message.☆17Apr 6, 2016Updated 10 years ago
- Open remote tables, be they CSV, XLSX, HTML, XML, ...☆33Oct 20, 2011Updated 14 years ago
- Generate changelogs from commit tags and shortlogs☆28Nov 2, 2025Updated 5 months ago
- Speech recognition in Python made easy and flexible