A tiny library for Python text normalisation. Useful for ad-hoc text processing.
☆157Mar 8, 2026Updated last month
Alternatives and similar repositories for normality
Users that are interested in normality are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆21Apr 14, 2026Updated 2 weeks ago
- Now included in rigour☆150Nov 24, 2025Updated 5 months ago
- A Python library for defining rule-based overrides on messy data☆18Nov 24, 2025Updated 5 months ago
- Fingerpaint with your data.☆18Feb 5, 2012Updated 14 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15May 2, 2015Updated 10 years ago
- Text normalization library for Python☆201Mar 26, 2018Updated 8 years ago
- Provide partial dates and retain the date precision through processing☆14Aug 4, 2025Updated 8 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆25Jul 15, 2025Updated 9 months ago
- US election metadata, packaged as python!☆10Mar 16, 2022Updated 4 years ago
- Archive of political ad data from the Federal Communications Commission☆20Oct 25, 2017Updated 8 years ago
- A selectable, scrollable list interface for terminal applications built using curses☆10Jun 30, 2015Updated 10 years ago
- A demo project and template repository showing how I use SpatiaLite with Datasette for quick spatial analysis.☆17Jul 7, 2024Updated last year
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Jul 6, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🧹 Python package for text cleaning☆1,010Jan 28, 2026Updated 3 months ago
- Python scraper to get weekly CDC flu surveillance data☆25Dec 2, 2014Updated 11 years ago
- API client for Aleph, supports bulk entity and document upload.☆29Mar 5, 2026Updated last month
- Lightweight web scraping toolkit for documents and structured data.☆315Jan 10, 2024Updated 2 years ago
- Little JSON object want to be graphs, too!☆17Oct 2, 2015Updated 10 years ago
- parse uniform crime reporting clearance data☆13Oct 2, 2015Updated 10 years ago
- Dump (freeze) SQL query results from a database into a selection of file formats☆92May 8, 2019Updated 6 years ago
- Tools for analyzing the Hillary Clinton emails☆13Apr 24, 2016Updated 10 years ago
- Platform for journalists to search, analyse, categorise and share unstructured data☆59Updated this week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A collaborative collection of structured datasets and document collections that are common to use within "Follow the Money" investigation…☆15Apr 14, 2026Updated 2 weeks ago
- Load CSV files into Postgres without explicit schema creation.☆81Jun 26, 2021Updated 4 years ago
- Obtained in December 2014 through a Freedom of Information request☆15Jan 29, 2016Updated 10 years ago
- Simple way to build a new dict based on fields declaration☆15May 7, 2019Updated 6 years ago
- N-grams approximate string matching implementation in pure Python☆26Sep 20, 2010Updated 15 years ago
- Ask questions about government data.☆38Jan 17, 2019Updated 7 years ago
- Scripts as a service. Builds on systemd (for Linux)☆21Mar 10, 2026Updated last month
- Service to scan licenses from source code☆12Aug 14, 2023Updated 2 years ago
- Watch a git repository, mirror it on a web server, and push to S3 with the appropriate commit message.☆17Apr 6, 2016Updated 10 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Open remote tables, be they CSV, XLSX, HTML, XML, ...☆33Oct 20, 2011Updated 14 years ago
- Generate changelogs from commit tags and shortlogs☆28Nov 2, 2025Updated 5 months ago
- Speech recognition in Python made easy and flexible☆11Sep 12, 2015Updated 10 years ago
- The open source platform that securely stores large amounts of data and makes it searchable for easy collaboration.☆75Updated this week
- buildstrap: when buildout+pip=♥☆16Sep 23, 2016Updated 9 years ago
- A basic analysis of aircraft suspected to be operated by the FBI for the purpose of surveillance.☆21Jun 3, 2015Updated 10 years ago
- Preprocessing Library for Natural Language Processing☆164Dec 6, 2022Updated 3 years ago