A tiny library for Python text normalisation. Useful for ad-hoc text processing.
☆157Mar 8, 2026Updated 3 months ago
Alternatives and similar repositories for normality
Users that are interested in normality are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆21Jun 20, 2026Updated 2 weeks ago
- Now included in rigour☆150Nov 24, 2025Updated 7 months ago
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 7 months ago
- Fingerpaint with your data.☆18Feb 5, 2012Updated 14 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Data cleaning and validation functions for names, languages, identifiers, etc.☆63Jun 25, 2026Updated last week
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15May 2, 2015Updated 11 years ago
- Text normalization library for Python☆201Mar 26, 2018Updated 8 years ago
- A small repo of notes and scripts for collecting data on U.S. deadly force police incidents☆10Aug 9, 2015Updated 10 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 11 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆27Jul 15, 2025Updated 11 months ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Jul 23, 2016Updated 9 years ago
- A selectable, scrollable list interface for terminal applications built using curses☆10Jun 30, 2015Updated 11 years ago
- Archive of political ad data from the Federal Communications Commission☆21Oct 25, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A demo project and template repository showing how I use SpatiaLite with Datasette for quick spatial analysis.☆17Jul 7, 2024Updated last year
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- 🧹 Python package for text cleaning☆1,024May 15, 2026Updated last month
- API client for Aleph, supports bulk entity and document upload.☆30Mar 5, 2026Updated 3 months ago
- Little JSON object want to be graphs, too!☆17Oct 2, 2015Updated 10 years ago
- parse uniform crime reporting clearance data☆13Oct 2, 2015Updated 10 years ago
- Dump (freeze) SQL query results from a database into a selection of file formats☆91May 8, 2019Updated 7 years ago
- Tools for analyzing the Hillary Clinton emails☆13Apr 24, 2016Updated 10 years ago
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆16May 13, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Load CSV files into Postgres without explicit schema creation.☆80Jun 26, 2021Updated 5 years ago
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆39Oct 12, 2023Updated 2 years ago
- Obtained in December 2014 through a Freedom of Information request☆15Jan 29, 2016Updated 10 years ago
- Simple way to build a new dict based on fields declaration☆15May 7, 2019Updated 7 years ago
- N-grams approximate string matching implementation in pure Python☆26Sep 20, 2010Updated 15 years ago
- Scripts as a service. Builds on systemd (for Linux)☆21Mar 10, 2026Updated 3 months ago
- An open database of international sanctions data, persons of interest and politically exposed persons☆762Updated this week
- Watch a git repository, mirror it on a web server, and push to S3 with the appropriate commit message.☆17Apr 6, 2016Updated 10 years ago
- Generate changelogs from commit tags and shortlogs☆28Nov 2, 2025Updated 8 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Open remote tables, be they CSV, XLSX, HTML, XML, ...☆33Oct 20, 2011Updated 14 years ago
- Speech recognition in Python made easy and flexible☆11Sep 12, 2015Updated 10 years ago
- buildstrap: when buildout+pip=♥☆16Sep 23, 2016Updated 9 years ago
- ☆32Aug 27, 2018Updated 7 years ago
- Preprocessing Library for Natural Language Processing☆164Dec 6, 2022Updated 3 years ago
- a python library for parsing unstructured western names into name components.☆621May 15, 2025Updated last year
- Data on Digital Media and Technology Expenditures in the United States Congress☆10Jul 17, 2017Updated 8 years ago