A tiny library for Python text normalisation. Useful for ad-hoc text processing.
☆156Mar 8, 2026Updated last week
Alternatives and similar repositories for normality
Users that are interested in normality are comparing it to the libraries listed below
Sorting:
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆21Apr 10, 2025Updated 11 months ago
- Fingerpaint with your data.☆18Feb 5, 2012Updated 14 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- Data cleaning and validation functions for names, languages, identifiers, etc.☆56Mar 11, 2026Updated last week
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15May 2, 2015Updated 10 years ago
- Text normalization library for Python☆201Mar 26, 2018Updated 7 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 7 months ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Jul 23, 2016Updated 9 years ago
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 4 years ago
- Archive of political ad data from the Federal Communications Commission☆20Oct 25, 2017Updated 8 years ago
- A selectable, scrollable list interface for terminal applications built using curses☆10Jun 30, 2015Updated 10 years ago
- A demo project and template repository showing how I use SpatiaLite with Datasette for quick spatial analysis.☆17Jul 7, 2024Updated last year
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- Python scraper to get weekly CDC flu surveillance data☆25Dec 2, 2014Updated 11 years ago
- API client for Aleph, supports bulk entity and document upload.☆29Mar 5, 2026Updated 2 weeks ago
- Lightweight web scraping toolkit for documents and structured data.☆315Jan 10, 2024Updated 2 years ago
- Little JSON object want to be graphs, too!☆17Oct 2, 2015Updated 10 years ago
- parse uniform crime reporting clearance data☆13Oct 2, 2015Updated 10 years ago
- Dump (freeze) SQL query results from a database into a selection of file formats☆92May 8, 2019Updated 6 years ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆58Updated this week
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆15Updated this week
- Load CSV files into Postgres without explicit schema creation.☆81Jun 26, 2021Updated 4 years ago
- Obtained in December 2014 through a Freedom of Information request☆15Jan 29, 2016Updated 10 years ago
- Simple way to build a new dict based on fields declaration☆15May 7, 2019Updated 6 years ago
- N-grams approximate string matching implementation in pure Python☆26Sep 20, 2010Updated 15 years ago
- Ask questions about government data.☆38Jan 17, 2019Updated 7 years ago
- Scripts as a service. Builds on systemd (for Linux)☆21Mar 10, 2026Updated last week
- Service to scan licenses from source code☆12Aug 14, 2023Updated 2 years ago
- An open database of international sanctions data, persons of interest and politically exposed persons☆694Updated this week
- Watch a git repository, mirror it on a web server, and push to S3 with the appropriate commit message.☆17Apr 6, 2016Updated 9 years ago
- Open remote tables, be they CSV, XLSX, HTML, XML, ...☆33Oct 20, 2011Updated 14 years ago
- Generate changelogs from commit tags and shortlogs☆28Nov 2, 2025Updated 4 months ago
- Speech recognition in Python made easy and flexible☆11Sep 12, 2015Updated 10 years ago
- buildstrap: when buildout+pip=♥☆16Sep 23, 2016Updated 9 years ago
- A basic analysis of aircraft suspected to be operated by the FBI for the purpose of surveillance.☆21Jun 3, 2015Updated 10 years ago
- Preprocessing Library for Natural Language Processing☆164Dec 6, 2022Updated 3 years ago
- a python library for parsing unstructured western names into name components.☆617May 15, 2025Updated 10 months ago
- Data on Digital Media and Technology Expenditures in the United States Congress☆10Jul 17, 2017Updated 8 years ago
- Framework and command-line tools for integrating FollowTheMoney data streams from multiple sources☆235Updated this week